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ABSTRACT 


The Stylist is a Waterloo Pascal program that analyzes the style of English prose. 


A “stvle checker”, The Stvlist pertains to Computational Stylistics and Computer 





Assisted Composition Instruction (CACI). The Stylist creates an affective model of 
the text based upon the following characteristics of its component words : etymology, 
tangibilitv, difficulty, emotional connotation and vigor. The Stylist then compares this 
model to the standards of fiction or nonfiction texts and reports results and 
reconimendations to the user. 

The Stylist also creates a concordance of the user’s input text using a new data 
structure called a Concordance Search Tree (CST). A CST is a binary search tree with 
a linked list threaded through it recording the order of the use of each word. An 
inorder traversal of the tree, with a traversal of the linked list during each visit, creates 
a concordance. 


This thesis also reviews related literature and programs. 
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I. INTRODUCTION 


A. BACKGROUND 
1. Computational Stylistics 

Computational Stylistics is the computer-assisted study of literary stvle. 
Computational Linguistics is the computer-assisted study of language itself. These 
disciplines are the automated subsets of Statistical (or Quantitative) Stylistics and 
Linguistics, which are the mathematical studies of style and language. All of these 
disciplines involve quantifying aspects of language and then manipulating these 
quantities in an attempt to gain insight into how and why the language works. 

The history of stylistics can be traced back to 1851, when Augustus de 
Morgan suggested that word-length could prove to be a distinctive trait of a writer's 
stvle. [Ref. 1] This suggestion prompted T.C. Mendenhall, an American geophvsicist, 
tO investigate whether mean word-length could resolve authorship problems such as 
those posed bv some of the disputed Shakespeare plays or the letters written under the 
pen name of Junius. Working in the late 19th century, Mendenhall analvzed the word- 
lengths of some two million words from various periods of English literature, using a 
primitive tabulating device that spit out reels of paper. His results, however, proved 
little. 

Poeerne “cajiy = 20in century, -anotner possible Characteristic of style, the 
frequency distribution of words, came under wide-spread investigation. G.K. Zipf 
postulated a “Rank-Frequency Law’, by which a ranking of the use of words in a text 
would show a constant decrease from the most-used word down to the least-used word. 
(Ref. 2] Other scholars, such as G. U. Yule, investigated such aspects as the richness of 
a writer's vocabulary and the length of sentences. (Ref. 3: pp. 363 - 390] By the 1950s 
and 1960s, the application of statistical methods to the study of literature had reached 
new heights of sophistication and complexity. Some scholars were investigating the 
significance of the ratio of verbs to adjectives and others were applving rigorous 
Statistical techniques. Despite the many fascinating insights offered by these lines of 
inquiry, their final significance and scientific credibility remained a question of much 
dispute. Enemies of the new disciplines included Norm Chomsky, the pioneer of 
formal languages, who argued that writing 1s a very human activity which involves a 


great deal of chance, and as such 1s not easily quantified. [Ref. 4] 


Computational stylistics began as a natural outgrowth of statistical stylistics. 
With the spread of computer access some thirty years ago, some scholars quickly 


realized the potential for their use in literary studies: 


“| first heard of computers in 1955 when mv wife (a mathematician) told me that 
she was going to work for an oil company as a program analyst... It took mea 
while to understand her flowcharts and computer programs, but when [ did, I 
realized that a computer could be used to solve other problems as well. Since 
then, | have used a computer for numerous applications relating to my work in 
the English Department of Cleveland State University, and have taught many 
others ... the power of electronic data processing in the study of language and 
literature.” [Ref. 5] 


The first challenge to the field of Computational Stylistics was to translate 
literary texts to electronic data. Much of the early literature of the field is devoted to 
this basic problem. After the first two decades, corpora, or bodies of literary texts in 
machine readable form, had been developed. The Lancaster-Oslo/Bergen (LOB) 
Corpus is a structured collection of 500 two-thousand-word texts of written British 
Enghsh. An American version of this effort is the Brown Corpus, constructed by 
Brown University! 

In addition, many literary classics have been painstakingly entered by 
kevboard. They are available from the Oxford Computing Laboratory at Oxford 
University and the American Philological Association. [Ref. 5] With the increasing 
capability and decreasing cost of optical readers, the problem of translating literary text 
to electronic data should become trivial. 

The usefulness of computers to. statistical stvlistics quickly became 
indisputable. Scholars used to labor to copy down on 3” X 5” cards each occurrence of 
every noun used in a Shakespeare play and then sort these cards by hand. [Ref. 6: pp. 
33 - 50] Now a computer program could manipulate the text in any number of ways 
within minutes. One of the other prominent successes of the use of computers in 
literary studies was the automation of the process of forming concordances. A 
concordance is an alphabetical listing of all the significant words used in a text, 
together with the enclosing phrase. Before the advent of computers, scholars 
laboriously built concordances of the Bible and the plays of Shakespeare. The power 


of the computer made possible giant concordance-building projects such as one for the 


‘Electronic forms of both of these corpora are available from: The Norwegian 
Computational Center for Humanistic Research, P.O. Box 53, University of Bergen 
N-5014, Bergen, Norway 
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Dead Sea Scrolls. The Centro Atomazione Analisi Linguista in Gallarate, Italy, used 
computers to build this concordance, which helped to resolve some of the missing or 
obliterated words. [Ref. 7] 

One of the next important challenges to computational linguistics was the 
issue of disambiguation. Many words have more than one meaning and can belong to 
more than one parts of speech. An example in English is the word “flies” in the 


following two sentences: 


Time flies like an arrow. 


Fruit flies like an apple. 


A human reader can easily decide that “flies” in the first sentence 1s a verb and 
that “flies” in the second sentence is a noun. Thuis decision 1s disambiguation. In order 
to compute such characteristics as verb-noun ratio, it is first necessary to disambiguate 
the parts of speech. Recent advances in computational linguistics have led to 
programs that can do this with more than 90% success. (Ref. 8] [Ref. 9: pp. 139 - 150] 
(Ref. 10] 

Scholars continue to argue about the usefulness of statistical stylistics. Efforts 
to discover and prove laws of distribution have not met with unchallenged success. 
Nevertheless, the work done in the past 130 years has laid some groundwork for the 
use of computers to study and analvze written prose. All of the professorial attempts 
thus have led to success at a humbler level : the tutorial. The history of stylistics is 
now culminating at this level, in the form of Computer Assisted Composition 
Instruction. 

2. Computer Assisted Composition Instruction (CACI) 

CACTI 1s a new discipline which has begun to emerge in the past decade. The 
success of word processing programs created a growing population of people who 
expected computers to help them to write. Enterprises such Bell Laboratories and 
centers of learning such as The Pennsvivania State University began to borrow from 
the field of stylistics as they developed computer programs that would help students 
and workers plan, write and edit prose. Several universities, such as Colorado State 
University, successfully instituted CACI as part of their composition curriculum. 


[Ref. 11] The earliest CACI programs required a mainframe or at least a minicomputer. 


With the expansion of the power of personal computers, however, software houses 
such as DecisionWare Inc. began to write CACI programs for that environment. Over 
10,000 copies of DecisionWare’s RightWriter are now in use at corporations, agencies 
and universities around the world, according to their advertisements. Given that word 
processing remains the single most common use of personal computers, as one 
megabyte of main storage becomes commonplace on new personal computers, stvle 
checkers such as RightWriter may soon become as popular as spell checkers have 


become in the past five vears. 


B. SCOPE OF THE STYEISE | 

The Stylist is a style checker akin to Bell Laboratory’s Writers Workbench and 
DecisionWare Inc.’s RightWriter. It does not help the user to plan or to compose his 
product. Instead, it analyzes the finished product and provides that analysis to the 
user for his consideration toward revision. The Stylist does not determine parts of 
speech, as does Writer's Workbench; nor does it suggest alternatives to hackneyed 
phrases, as does RightWniter. What the Stylist does instead 1s to analyze some of the 
characteristics of the words used in the text. By doing so, The Stylist can distinguish 
between short, vigorous, germanic, emotional texts and long, lazy latin texts. The 
success of this effort could and should be incorporated into more extensive programs 
such as Writer's Workbench. 

In addition, The Stylist solves the traditional problem of creating a concordance 
in a novel and elegant way. This solution is optimal for the personal computer 


environment of the coming few vears. 


Il. RESEARCH FOR THE STYLIST 


A. LITERATURE REVIEW 
When I first conceived of The Stylist, I believed that a “style checker” was a 
completely original idea. Little did I know that major universities and great 
coportations had been working on the problem for decades. My research began with 
the Encyclopedia Brittanica, where | discovered the existance of the fields of stylistics 
and statistical linguistics. Using these as subjects, I searched the Dewey decimal 
svstem and several automated data bases for titles. The books | discovered referenced 
the two journals which publish many of the pertinent articles : Computational 
Linguistics, and Computers and the Humanities. These, in turn, carried advertisements 
for some of the currently available software. They also identified the current centers of 
learning, some of which I contacted for guidance and information. The experience of 
researching the field demoted me from its inventor to its lowliest tyro. It also subjected 
me to many ideas, some of which I incorporated into The Stylist. The bibliography 
lists the sources which I unearthed which may benefit other students. In this 
discussion, I will cover only those articles which directly influenced the development of 
The Stylist. 
1. Affective Tone 

C.W. Anderson and G.E. McMaster reported on a program called PSA 
(Psychological Semantic Analvsis) which analyzes the emotional tone of a text. 
(Ref. 12] They built upon the work of D.R. Heise, who ascribed values of “Evaluation, 
Activity and Potency” to the thousand most commonly-used words in the English 
language. [Ref. 13] PSA allows the user to enter his text one line at a time; if anv word 
is ambiguous, PSA asks the user to disambiguate. PSA then matches these words to 
the 1000 Heise-word dictionary, adding values up to create a profile of the user’s text. 
Table | helps clarify the meanings of the three Heise-word categories. 

After building and testing PSA, Anderson and McMaster concluded that: 
“The affective tone of whole passages can be measured by computer-assisted collection 
of... scores of those words ... for which Heise (1965) has provided semantic 


meanings.” {Ref. 12] 


He 


eee 


SR MR peers ae 


TABLE 1 
HEISE WORDS VARS 


| EVALUATION 
Hacgit Neutyael Low 
Enucen experiment war 
God DEOVie fire 
beauty mountain disease 
pleasant suggest ba 
ACTIV iy 
| High Neutral Low 
) fire well dead 
| great Know Silent 
attack Basie sleep 
ES iice presence rock 
POTENCY 
High Neutral Low 
steel hear love 
LGOr Cle kiss 
| rock all baby 
hard indicate wife 


The original idea of The Stylist envisioned doing just such a count of words 
chosen for their frequency of use and their characteristics such as etymology, emotional 
connotation and vigor. Anderson and McMaster convinced me that this method had a 
sound psychological base. Moreoever, Heise’s categories of Evaluation and Activity 
seemed to correspond directly to my envisioned categories of emotional connotation 
and vigor. His category of Potency, however, struck me and continues to strike me as 
distressingly Freudian. Because The Stylist would not use that category and because it 
would use others, I decided not to borrow Heise’s words and values, but rather to build 
my own dictionary and ascribe mv own values. This seemed a reasonable approach, 
given the statement by Anderson and McMasters that “there is much commonality in 
the emotional response of different persons to words and objects.” 

2. Readability and Sentence and Word Length 

Two of the most common stylistic measures are sentence length and word 
length. The most common use of these measures is to determine readabilitv. The basic 
idea is that short sentences are easv to read and long sentences are hard to read. 


Various formulae, such as the Kincaid, the Automated Readability Index (ARI), the 
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Coleman-Liau, and the Flesch Reading Ease Score, [Ref. 10] attempt to determine 
readability as a straight-forward function of sentence and word length. These formulae 
have had considerable impact on the teaching of English. Most style checkers use 
then. Some, Itke RightWriter, admonishes the writer any time a sentence grows 
beyond 22 words or so. Hflaving read a great deal of Victorian novels and Madison 
Avenue copy, Im well aware that the ideal 20th century sentence is short. It’s 
vigorous. Izasy to understand. Maybe even dispenses with its verb as it rushes toward 
its punctuation! The mechanization of this modern tendency into simphisuc formulae, 
however, seems Co me pernicious. A sentence should be the unit of a complete idea. If 
We limit ourselves to short sentences, we may be limiting ourselves to small ideas. The 
average sentence of the prose of the Age of Reason was 45 words. If Samuel Johnson 
were to live today, no one would let him finish a sentence. My distrust of these 
formulae redoubled when [ read “Readability is a Four-Letter Word,” by Jack Selzer. 
(Ref. 14] In this article, Selzer forcefully makes the points that readability is a 
subjective quality which is greatly influenced by factors such as arrangement of ideas, 
reader background and interest, and difficulty of vocabulary. The Stylist does not use 
readability formulae. It does flag sentences which seem to be run-on. More 
importantly, tt gauges the difficulty of the vocabulary. If the writer uses hard words, 
long words, latinate words and long sentences, The Stylist warns him, particularly if the 


work ts fiction. 


B. PRODUCT TESTING 
!. RiehtWriter 

As part of my research, I procured a copy of RightWriter” and tested it with 
some of my own writing. Overall, the program impressed me with its capabilities and 
is engineering. RightWriter reproduces the users text with inserted comments. Its 
constant challenge of long or complex sentences forced me to recxanune each case in 
particular. Several lengthy sentences became two short ones. It never let me begin a 
sentence with “But,” a foible of mine. It applauded my writing when it was strong and 
i derided it when it was pompous. One attractive feature was the production of a 
alphabetical word list with frequency of occurrence. This feature demonstrated my 


overfondness for the words, “ancient” and “thousand”. RightWriter also suggested 


Right Writer (tin) is a product of Decisionware Inc., 2033 Wood St., Suite 218, 
Semasota, I]. 33577, (813) 952-9211. 
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substitutes for hackneyed or useless phrases such as ‘the fact that’. Overall 
RightWriter proved itself extensively useful. 


2. PC-Style 

PC-Style? is a much less ambitious program than RightWriter. PC-Style 
required only JOK of RAM, compared to 192K for RightWriter. PC-Style is also much 
cheaper, costing only $29.95 as compared to RightWriter’s $95. I tested it with the 
saine test data set [ had used on RightWriter. PC-Style has a nice human-factor 
feature, in that it constantly displays to the user how much it has done and how far it 
is from finishing. RightWriter is more a coffce-break program : you execute it and 
then you go make some collee. PC-Stvle, however, had little to recommend itself 
besides this leature. It relics upon a readability formula. It also attempts some 
affective modeling, based upon its dictionary of 50 action verbs. This miniscule 
dictionary 1s inadequate for the task. A match of less than two percent of my input 
text with these 50 words was typical; such a small sample is inadequate to qualify the 
vigor of a passage. Another simplistic but more valuable tactic of PC-Style was to 
count the frequency of “personal” words, such as “I”, “you” and “we”. Although the 
conventions of the more stufly forms of writing forbid them, it is generally accepted 
that most technical writing benefits from direct, personal pronouns. PC-Style 
reinforces the clarity and forcefulness of direct rhetoric. Despite these few nice 
features, PC-Style is a too simplistic to be of lasting utility. 

3. Writer’s Workbench 

Although [ was unable to experience Writers Workbench, [| did obtain 
enough research materials to form an impression of ts utulity. (Ref. 10] Writer's 
Workbench? is actually a complex of 32 programs. Together, these programs provide 
more than all the features of RightWriter. STYLE calculates readabilitv, using the 
previously-discussed formulae. It also analyzes sentence type (simple, complex or 
compound). SIYLE is able to disambiguate the words of the input text with 95% 
accuracy. It then analyzes the use of verbs and miodiliers. If a passage relies too 
heavily on the passive voice or it 1s fat with modifiers, STYLE warns the user. The 
follow-on program, DICTION, detects hackneyed phrases. SUGGEST suggests 


replacements. Overall, Writers Workbench appears to be the industry standard. 


*PC-Style (tm) is a product of ButtonWare Inc., P.O. Box 5786, Bellevue, WA 
98006, (206) 454-0479. 


+Writer’s Workbench (tm) is a product of Bell Laboratories, Murray Hill, New 
Jersy, 07974. 
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C. CONCLUSIONS BASED UPON RESEARCH 

Now I knew that I would not be writing the first style checker. Nor would I be 
writing the second, third, fourth or fifth. Some solace could be found in the fact that 
the previous programs relieved me of the need to incorporate all stylistic features into 
The Stylist. I could extend the affective modeling of PSA by including other values of 
words. Three of these new values could be more objective--etymology, difficulty and 
tangibility. Together with vigor and emotional connotation, these values should be 
able to describe a profile of the user’s input text. 

Additionally, | could increase the accuracy of such style checkers by pointing out 
the need to make allowances for the genre of the user’s input text. Any reader of 
technical as well as fictional writing knows that the characteristics of these two tvpes 
often vary more widely than the style of writers within the tvpe. Comparative stvle 


analysis, therefore, would clearly seem to need to take the genre into account. 


i 


[i. DESIGN OF THE Sitieiss 


A. CHARACTERISTICS OF WORDS 

The fundamental idea of The Stylist is that individual words have power. The 
denotation of a word is its meaning. The connotation of a word is its emotional 
impact. For example, “pupil”, “student”, and “scholar” all denote a person who studies 
and learns. The emotional connotations, however, range from the humble “pupil” 
through the familiar “student” to the lofty “scholar”. Besides these emotional 
connotations, words have other intrinsic values which can be quantified. The Stylist 
would concentrate on the following values: Etymology, Tangibility, Difficulty, 
Emotional Connotation and Vigor. 

1. Etymology 

One of the beauties of the English language is that its vocabulary embraces 

two main sources: native and borrowed. Our native words mainly come to us from the 
Anglo-Saxon tongues. Borrowed words come from Latin, mostly by way of the French 


of the Norman conquest. The following table illustrates the differences : 





VABUE 2 
EXAMPLES OF NATIVE AND BORROWED WORDS 


Native Words Borrowed Words 


Man Rencou 

Enger Paice 

Thinker Philosopher 
Fire Conflagration 
Book Volume 
Diveaag Miscreant 





Native words are short, strong and rough. Borrowed words tend toward 
length, gentility and elegance. Good English prose (particularly good fiction) favors 
native words. Poor English prose (particularly bad technical writing) exhibits a 
tendency to overutilize latinate etymology. By counting up the number of native and 
borrowed words in a user’s input text, The Stylist could see how it compares to good 


writing of the appropriate genre. 


2. Tangibility 
A word can either evoke an image of a thing or it can refer to an idea. I call 
the former, “tangible” and the latter, “intangible”. The following table illustrates the 


difference: 


RAB LE 
EXAMPLES OF TANGIBLE AND INTANGIBLE WORDS 


TANGIBLE INTANGIBLE 
R6GK aie aera 
aps. Automatically 
Beehive Then 
Corvette Rey 
Hammer Preliminary 


Tangible words are concrete, exact and evocative. Intangible words are ideal, 
general and cognitive. All writing uses both. Good writing usually takes advantage of 
tangible words. Even the most philosophical writing benefits from the use of tangible 
words. (See Appendix E for an analysis of a Platonic dialogue). Good fiction writing 
rarely strays too far toward the intangible. The Stylist could count the tangible and 
intangible words of the user’s input text and compare this count to good writing of the 
appropriate genre. 

3. Difficulty 

This is one of the most indisputable characteristics of words. For the 
purposes of The Stylist, I defined four levels of difficulty : Elementarv, High School, 
Graduate and Postgraduate. The words of each level are those most likely to be used 
in speech with ease by an average person of that educational level. Although I’m 
aware that reading, writing and speaking vocabularies are different, | contend that a 
person reading a word within his reading vocabulary but outside of his speaking 
vocabulary often must pause for a mental translation into more simple terms. The act 
of this pausing raises the difficulty level of the text. Table 4 illustrates the categories. 
These examples are based upon my judgements. 

By counting the occurrence of difficult words, The Stylist could determine the 


overall difficulty of the vocabulary and thus the readibilitv of the text. 
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TABLE 4 
EXAMPLES OF WORDS OF VARIOUS DIE El Gmina 


| 
ELEMENTARY HIGH SCHOOL GRADUATE POSTGRADUATE 


Bag ene Matra ae Execrable 
Sister Rendezvous Immaterial Parsimony 
spoon Transmission Tonality Recuyeaaae 
Ferenc Foreman Universal Homomorph 
Wish Process [Lnduceiion Dilateie, 
Done Joyous Processor Zyqgore 
Handsome Undergo Linear Synergy 





4. Emotional Connotations 


The Stylist would have five categories of emotional connotation : Sublime, 
Pleasant, Neutral, Unpleasant and Horrid. The following table illustrates the 


categories: 


TABIERSS 
EXAMPLES OF WORDS OF VARIOUS EMOTIONAL CONNOTATIONS 


| 
| 
| 
SUBLIME PLEASANT NEUTRAL UNPLEASANT HORRID 


Beauty Happy Lhe Damage Cancer 
Sun rae Hood Which hoss. 9 Murder 
ViLGCory Friendly Brick Insueing Whore 
Love Warm Is Loser Fuck 

i Goa~ 2. Helpful Name Ses Death 

| Paradise Sex When Wound Traiteu 


By counting the use of these types of words, The Stylist could determine the overall 
emotional tone of the passage. It could also detect flat or emotional writing. 
5. Vigor 

A related but distinct characteristic is vigor. Sublime and horrid words tend to 
be highly vigorous, but not all highly vigorous words are emotional. Examples of 
vigorous but unemotional words are “sprint”, “rush” and “cross”. This category also 
tends to be more objective. For example, the word, “soldier” can have widely different 
connotations for different people. The word “soldier” would please a career Army 
officer but it would displease a survivor of the Japanese occupation of Canton. Both 


could agree that “solider” 1s a vigorous word. 
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LAB ie 
Eevee LES OF WORDS OF VARIOUS VIGOR 


VIOLENT ENERGETIC CALM INERT 
Destroy Seas Read From 
Creation Dive. Write oe 
Fire Proce ine Manager something 
Atenie Wedding Ocean Comma 
Holocaust steam Blue Paper 
Conqueror Eby ing Source Format 


By calculating the vigor of the words of the user’s input text, The Stylist 


would be able to estimate its overall strength. 


B. THE NEED FOR A DICTIONARY 

To quantify the above five aspects of the words of the user’s input text, it would 
be necessary to maintain a dictionary of the most commonly used words and their 
values. The original idea of The Stylist forsaw just such a dictionary. After reading 
Anderson [Ref. 12] and testing PC-Style, the idea seemed less original but still valid. 
The heart of The Stylist, therefore, would be its dictionary. Before I began anv top- 
down designing, I first wanted to explore the technical challenges and possible pitfalls 
of building such a dictionary. A review of data structure literature convinced me that 
two approaches were the most feasible: hashing and Binarv Search Tree. Since I had 
already decided that The Stylist would also produce a concordance, I had to take into 
account the need for efficient alphabetical traversal of the words of the user’s input 
text. My intuition that hashing would be preferable for dictionary look-up but would 
not lend itself easily to concordance-building was confirmed by a passage in an 
excellent text by Donald E. Knuth. [Ref. 15: p. 540] Having decided to implement the 
dictionary with a Binary Search Tree (BST), I began to build a series of prototypes. 


C. THE COPYDIX PROTOTYPES 
CopyDixl through CopyDix6 were early experiments in using a Binarv Search 
Trees for dictionary lookup. Naturallv, the word was the kev value. The experience of 


the CopyDix series taught me the following lessons: 
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|. The Benefit of Preorder Storage 
The structure of a Binary Search Tree is maintained during execution as a 
system of pointers. These pointers refer to locations in memory which pertain only to 
that execution. [fa dictionary is modified during execution (for example, if new words 
are added), then the BST should be stored to its file in preorder. An inorder storage 
seems the most logical way but it is actually the worst. If a BST is stored using an 
inorder traversal, the next time it 1s loaded it will be that least bushy of all BSTs: a 
linked list. Storage using preorder will cause the next loading of the tree to duplicate 
the last. 
2. The Need for AVL 
Even with preorder storage, the dictionary needed the capability to change: to 
grow or shrink as the user desired. Any particular office or curriculum has its own 
special vocabulary. I wanted to give The Stylist the ability to adapt its dictionary to 
the vocabulary of its user's environment. Without some mechanism for rebalancing 
the BST after the insertion of new words or the deletion of unwanted words, the BST 
could become increasingly lopsided. Searching for words in such a lopsided tree would 
become inefficient. Clearly, an AVL scheme was required. I adapted an AVL insertion 
routine from an excellent text by Niklaus Wirth. [Ref. 16: pps. 220 - 221] 
3. Storage requirements 
The CopyDix series allowed me to see whether a BST of 5000 words and 
associated values would fit into the main storage available to a healthy personal 
computer. I determined that 1.5 megabytes would be sufficient for The Stylist and its 
data structures. The Copydix series allowed me to create such a structure and to prove 
that it would not require more than one megabyte. 
4. Text Processing 
The CopvDix series also identified some unforseen complexities in processing 
text. For example, in order to include contractions, it was necessary to include the 
single quotation mark (’) in the dictionary. Waterloo Pascal uses this character to 
define the beginning and end of user-defined constants and strings. To include 
contractions, it was necessary to declare “Succ(’@’)” instead of (’)! There were a few 
other similar problems requiring equally unhappy solutions. 
5. Building a Dictionary 
Finally, the CopyDix series allowed me to begin to build a dictionary of most 
frequently used words. I ran the text of two novels and several technical articles 


through the Copydix series, accumulating a dictionary of over 2000 words. 
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D. CONCORDANCE SEARCH TREE (CST) 
1. Original Concept 

Based upon the experience of the CopyDix series, [ began to play around with 
the various ways of building a concordance. The literature provided two examples of 
how the problem has already been solved. One solution was to create a search tree. 
The key value of the search tree was the word; two associated tables, TOKEN and 
TYPE, recorded the information about the sequence of occurrence of the words. 
[Ref. 7: pp. 186 - 191] Another solution familiar to most computer scientists was the 
Key Words in Context (K WIC) program designed by Parnas. [Ref. 17] This solution 
creates a KWIC listing by circular shifting each line and then sorting each line; such a 
KWIC listing is similar to a concordance. 

I wanted to create a concordance without relying upon any external storage. 
My experience with the CopvDix series had shown that text processing 1s both I,O and 
computation bound: I,O bound because large text files must be loaded; computation 
bound because each word must be processed character by character. [I wanted the 
solution to eliminate the need for further input or output (such as using secondary 
storage to build files containing KWIC lines). I also wanted to minimize computation 
by allowing The Stylist to remember the order of occurrence of each word of the input 
text, without having to recompute it. I wanted my solution to flow as naturally as 
possible from the process of looking each word up in the dictionary. I didn’t want to 
have to process each word twice, or to search for the position of any word twice. 
Ideally, the Stylist would have enough main storage available that it could retain all 
important information, without resorting to recomputation or to secondary storage. 
These ideas led me play to around with various ways of using a BST to record the 
order of occurrence of the words of the user’s input text. One obvious solution would 
be to have each node of the BST contain pointers to associated data structures such as 
an array of linked lists containing the sentences of the input text. Such a solution, 
however, would require storing the words twice. CopyDix had shown that words take 
up a great deal of storage space. Storing them twice would be wasteful. With these 
ideas in mind, IJ lit upon the idea of the Concordance Search Tree (CST). 

A CST is a Binary Search Tree with a linked list threaded through it. This 
linked list is two-way, connecting each word of the BST to the word used before it and 
to the word used after it. An inorder traversal of the BST visits each word in 


alphabetical order. During each visit, a traversal of the linked list in the “backwards” 


direction would encounter all the words used before that word. Printing these words, 
then printing the key word being visited, and finally printing the words in the 
“forwards” ‘direction creates a concordance. 

This solution offers some exciting facilities. It allows the user’s input text to 
be accessed in any number of ways. Ifa concordance of three, five, seven or thirteen 
words per line is desired, changing a few global constants immediately fulfills that 
desire. If only the sentences containing a certain word are desired, a insert search of 
the CST keving on that word, and then a concordance traversal on that node, provides 
all those sentences. In short, a CST is a complex structure with a challenging but 
strong intuitive appeal. It provides a great deal of flexibility to the processing of text. 

One added wrinkle of complexity lies in the fact that many words of the input 
text Occur more than once. The word, ‘the’, for example, occurs many times. To 
guarantee a correct concordance traversal of the CST, each occurrence of the word 
“the” must be associated with a unique pointer. This requirement is satisfied by 
creating a unique node of pointers associated with each occurrence of the word. The 
main node of the CST contains the word, its values, and left and right pointers. It also 
contains a pointer called “down”, which points to the linked list of unique nodes of 
pointers. The first such node pertains to the first occurrence of the word, “the”, in the 
users input text. [t points to the word used before and the word used after that 
occurrence of the word “the”. (These pointers are called “last” and “next’.) It also 
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points “down” to the second occurrence of the word “the”. So the Stylist creates a 
concordance through the following traversals: an inorder traversal of the CST visits 
each main node in alphabetical order. During that visit, The Stylist goes down the 
linked list of the occurrences of that word. During each step down the linked list, The 
Stylist traverses “lastward” and then “nextward” to print out the line of that 
occurrence. When The Stylist reaches the bottom of the downward list, 1t continues 
the inorder traversal. 
2. The Tril - Tri9 Prototypes 

To examine the feasibility of the CST concept, I wrote a series of progressively 
more capable prototypes called Tril through Tri9. (The names refer to a “[rinary 
Search Tree”.) I found that I could implement all the procedures necessary to build 
and traverse a CST in only 137 lines of code. Moreover, the performance of the 
concept, both in terms of main storage and processor requirements, seemed extremely 


satisfactory. 


E. DESIGN OF THE STYLIST 

Having used a series of prototypes to identifv and solve the critical issues, I set 
them aside and began to design The Stylist. After jotting down some logic flowcharts 
and data flow diagrams, I decomposed the problem into three main modules which I 
called “Reader”, “Researcher” and “Reporter”. 

1. The Reader Module 

Only Reader has access to the user’s input text. [t reads that text and passes 
along a data structure called “readnode”, which is merely the word and its length. The 
Reader also signals the ends of sentences and the end of the file. 

2. The Researcher Module 

Researcher receives the “readnodes”, researches the qualities of that word, 
builds a profile of the user’s input text based upon the tally of the qualities of those 
words, and then passes that profile on to the Reporter Module. If the user wants one, 
Researcher also creates the concordance. Only Researcher has access to the dictionary 
file and to the CST. Neither Reader nor Reporter know how the dictionary is 
implemented or how the concordance 1s produced. This is the most crucial instance of 
information hiding in the design. 

3. The Reporter Module 

Reporter receives the profile, which it compares to the profiles of other texts. 
Based upon this comparison, it passes on recommendations and commendations to the 
Wiser. 

4. Low Level Design 

Having decomposed the problem into these three modules, I jotted down a 
plan for the procedures that would compnse them. This plan included the 
input/output and operations of each procedure. This low level design included all the 
parameters of the procedures and identified the hierarchy between the procedures and 
modules. 

One of the important features built into this low level design was the human- 
factor facility of reporting to the user the progress of the execution of The Stylist. The 
need to send messages to the screen telling the user what was happening required 
counting steps and further input/output, but my experiences with RightWriter, PC- 
Style and the CopyDix prototypes had convinced me that the expense was worth it. 
Psychologically, staring at a blank screen for five minutes seems ten times as long as 


reading a dynamic screen for seven minutes. To provide the user the option of quicker 


but more boring execution over slower but more engaging execution, I designed a 


facility that asked him whether he wanted frequent reports, occasional reports or one 


report when done. 


With this low level design in hand, I was ready to begin to code. 
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TV. IMPLEMENTATION OF THE STYLIST 


A. CODING 

Because of the experience gained from the CopyDix prototypes and because of 
the top-down, modular design, coding the Reader and Researcher modules took only 
two weeks. Prototyping and top-down design allowed me to code quickly and with few 
errors. (The usefulness of these software engineering techniques would also prove itself 
in the testing phase.) As I had the CopyDix series, [ implemented The Stylist in a 
series of iterative steps called Stylel through Style9. 

Stvle6 brought to light light an unforseen problem: the interaction between the 
facilities of adding new words to the dictionary and creating a concordance. The 
design called for allowing the user to decide whether he wanted to add new words to 
the dictionary and whether he wanted a concordance. These two choices created four 
cases: Growing Dictionary and Concordance; Growing Dictionary and no 
Concordance; Static Dictionary and Concordance; Static Dictionary and no 
(Moncordance. in the case of Static Dictionary and Concordance, to create a 
concordance, new words found in the user’s input text would have to be added to the 
CST. But since the user had chosen not to add new words to the dictionary, The 
Stvlist could not prompt him for the values of these new words. This was not a 
problem until the same new word was used again. Researcher would find this new 
word in the CST but would not find anv values for the word. This would cause a fatal 
error: “Word.etvmology has unassigned value.” To avoid this problem, it was 
necessary to add a new field to each of the nodes: Status. The status of a node could 
be: “Valuable”, which meant that the node contained values for the other fields; 
“Not_Valuable”, which meant it didn’t. 

This particular problem grew even more gnarly when I decided to extend the 
Look_Up procedure of Researcher. The orginal design called for Look _Up to try to 
match the input word only with its exact equal in the dictionary. This worked fine, but 
it meant that the dictionary would have to contain all the following variants of “look” : 
“look”, “looks”, “looked”, and “looking”. The extension created a new procedure called 
Look_Up Variants. If the input word didn’t match any word in the dictionary, 


Researcher looked up variants of the word. If it found one, the values of the variant 
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were added to the profile. This extension allowed a more compact dictionary and it 
increased the probability that most words would be found, but it complicated the 
problem described in the previous paragraph. In the case of Growing Dictionary and 
Concordance, a word like “looked” would have to be added to the CST for the sake of 
a concordance, but now it would also be added to the dictionary. Look _up variants 
would thus eventually gum up the dictionary with a lot of useless variants. Solving the 
problem meant adding a third status. The three statuses were now: “Storable”, which 
meant the node was a first class citizen, a word which had values and should be stored 
to the dictionary; “Not_Storable”, which meant that the node was a second class 
citizen, a variant word which had values, but should not be stored to the dictionary; 
and “Not_Valuable”, which meant the word was a third class citizen, a new word which 
had no values, nor should be stored to the dictionary. 

The solution of these problems completed the coding phase of the Reader and 
Researcher modules. The Reporter module remained a stub. The Stylist could not 
make recommendations and commendations to the user until it had established its 


dictionary and built up a history of other profiles. 


B. BUILDING THE DICTIONARY 

The CopyDix series and Stylel through Style7 had built up a dictionary of some 
3000 words, which contained about 80% or 85% of the words of any given text. Each 
of these 3000 words, however, now required the assignation of five values : etvmology, 
tangibilitv, difficulty, emotional connotation and vigor. That totals to 15,000 values. 
After experimenting, I developed a system that allowed me to enter those 15.000 values 
in two weeks. First, I sorted the dictionary alphabetically and printed out a listing. 
Using Funk and Wagnall’s Collegiate dictionary, | checked the etymology and 
highlighted the borrowed words on the print-out. Then I wrote a program called 
AUTOMATE, which displayed a screen for each of the five values of each word and 
required one keystroke to assign a value. Using this system, I assigned values to all 
3000 words, going at a speed that relied upon a subjective reaction similar to the 
reaction of a reader scanning a text at a comfortable reading pace. Then I wrote two 
new programs, DIXSPLIT and DIXJOIN. DIXSPLIT split the dictionary into 17 
intersecting subsets : all native words, all borrowed words, all tangible words, all 
intangible words, and so on. Each of these subsets was like a formation of soldiers. 
Any word that didn’t belong in that subset stuck out conspicuously. After correcting 
these mistakes, I executed DIXNJOIN, which brought the amended dictionary together. 


Now I was ready to begin testing The Stylist. 
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V. TESTING OF THE STYLIST 


A. INITITAL TEST DATA SET 

The initial test data set for The Stylist included both fiction and technical writing. 
Fiction was represented by long passages from two of my novels and 1000-word 
excerpts from the novels of Ernest Hemingway, Raymond Chandler and Kurt 
Vonnegut. Technical writing was represented by about a dozen student essays and 
excerpts from three textbooks on computer science. In both categories, I included 
both good and poor writing. I ran all the texts through Stvle8, the Reporter module of 
which merely printed out the values of their profiles. Style8 performed robustlv. | 
executed it under all four cases pertaining to the dictionary and concordance as well as 
under all three cases pertaining to the frequency of execution status reports. Style8 
revealed no major flaws. Its shortcomings were well within the scope of the original 
design. (See Appendix A, Suggested Extensions to The Stylist). Style8 suggested some 
fine-tuning changes. 

Two changes dealt with the part of the CST scheme called the FourDix. The 
FourDix contains the most common function words, all of which are four letters or less 
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emieneth. Examples of FourDix words are “the”, “a, then’, “or’, “he’, “she”, and so 
on. The FourDix is a separate CST. Words of the user’s input text of four or less 
letters in length are first sought for in the FourDix. This is an economy measure, since 
a large portion of any text 1s made up of the function words found in the FourDix. 
Another savings of the FourDix 1s that the concordance is created by an inorder 
traversal of the main CST. This means that the functional words of the FourDix are 
included in the concordance only in the phrases embracing the more significant words. 
Otherwise, the concordance would have dozens or perhaps hundreds of lines showing 
each use of words like “the”. The first fine-tuning change suggested by Stvle8 dealt 
with the values of the words of the FourDix. These words had values, of course, but 
they tended to obscure the overall picture. Eliminating their values from the tally was 
the first solution. I modified this to adding only those values of personal pronouns, 
which are words of energetic vigor. This is similar to the approach of PC-Style. The 
second fine-tuning change was not adding the length of the FourDix words to the tallv 


of word lengths. With the FourDix words, a bar graph of word length usually had two 


29 


2 See 





humps : one tall one around three letters and another, shorter one near five letters. By 
elimunating the length of the FourDix words, both fiction and nonfiction writing always 
showed a bell-shaped curve. The bell of fiction writing tends to be a tall one centered 
on five letters/word; the bell of nonfiction writing tends to be a shorter one centered on 
six letters/word. All this means is that fiction writers use shorter words and words of 
more uniform length, whereas nonfiction writers use longer words and words of more 
variable length. As a stylistic measure, its use is limited. As a bar graph, it is a lot of 
fun. 

In any case, the profiles obtained allowed me to examine the true usefulness of 
The Stylist. Thankfully, The Stylist was able to distinguish between the novels of 
Ernest Hemingway and the ramblings of computer science graduates. The profiles of 
each text corresponded closelv to my subjective impressions. (This is not surprising 
since the values of the dictionary were also the product of my subjective decisions. In 
this sense, The Stylist is an expert svstem that has automated my own rules and sense 
of style. The facility of allowing the dictionary to change, however, allows any user to 
adapt The Stylist to his own taste). 

I now could finish coding Reporter. Reporter now took the profile and 
manipulated its values to create an Analysis. This manipulation took into account the 
genre of the user’s input text and involved simple weighting factors rather than any 
complex statistical methods. Like the original values of the words of the dictionary, 
these weighting factors depended upon my own subjective impressions, but they also 
derived from the results of the test data set runs. If a technical paper had four times as 
many intangible words than tangible words, it read like so much mush. So I 
incorporated a ratio of 4:1 as the limit of intangible:tangible for nonfiction. I tried to 
be conservative in these weightings, because I always envisioned The Stylist as a 
descriptive rather than a prescriptive tool. The Stylist thus rebukes the user only when 


the qualities of his text fall far outside the bounds of the norm. 


B. FIELD TESTS 

The Stylist was finally ready for field testing. This final phase was meant to 
mimic the introduction of the software product for consumer use. The field testing 
took place in two parts. In the first, two computer science graduates were briefed on 
the program and asked to use it. In the second, 22 papers for an Administrative 


Science composition course were run though The Stylist. 
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1. Field Test One : The Ape Test 

The two computer science graduate students evaluated the program by 
unleashing it on some of their own writing. Their overall reaction was highly positive, 
possibly because The Stylist seemed to approve of their writing. They praised the 
human factors and the graphics. Thev thought that the breakdowns of the values and 
the concordances were intriguing and offered them insights into their own writing. 
Thev complained that the report didn’t explain the meanings of some of the values. 
They also didn’t like the need to remove embedded commands from the input text. | 
addressed these complaints by adding some explanations to the report and by changing 
Reader so that it didn’t signal End_of_Sentence when it encountered an embedded 
command. 

2. Field Test Two : The Writing Class 

A Naval Postgraduate School Administrative Science composition class 
provided 22 short papers for testing. These students are almost all military officers 
who are considered by their services as top performers. As such, they constitute a 
somewhat literate and professional test group. Their fairly uniform papers were 
apparent attempts to incorporate some of the lessons of good business communication. 
[ ran their papers through The Stvlist, then provided the reports to the Professor, who 
Poncetiem tontne students. {| never met the students; neither did they ever use The 
Stvlist themselves. Of the 22 students, 12 filled out a questionnaire detailing their 
reactions to the reports. 

Most (10 out of 12) had never heard of a style checker before. Most (11 out 
of 12) understood the reports. One student thought The Stvlist was worthless: 11 
found some of its features helpful. Six students found the measures of word length and 
sentence length helpful. Nine found the measures of vigor, etymology and difficulty 
helpful. Six liked the concordance; four didn’t think it was useful. Interestingly, onlv 
five thought The Stvlist accurately reflected the qualities of their writing; three were 
sure it didn’t; three just weren't sure. One common complaint was that they weren't 
sure of the meanings of some of the categories. 

Overall, this test underscored the need for a user's manual. Appendix B is just 
such a manual. If this manual had been available for the students or if they had had 
more experience using The Stylist, then they would have better understood the 
meaning of the reports. Despite this drawback, it was obvious from the comments in 


the questionnaire that the majority of the students liked The Stylist and would want to 
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The Stylist would be able to find a place in the classroom and possit 


market. 
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VI. CONCLUSIONS 


Researching, designing, coding and testing The Stylist was an excellent academic 
exercise. It brought home many of the lessons of software engineering. It sharpened 
my skills. The final product, The Stylist, seems a success in that its affective modeling 
works. As such, The Stylist should serve as a contribution to the development of other 
stvle checkers, rather than a stand-alone style checker itself. As to whether writers, 
teachers and students should use computers to analvze writing, [’m reminded of the 


following passage from Plato’s “Phaedrus”: 


Socrates. 

At the Egyptian city of Naucratis, there was a famous old god, whose name 
was Theuth... his great discovery was the use of letters. Now 1s those days the 
god Thamus was the king of the whole country of Egypt... To him came 
Theuth and showed his inventions... when they came to letters, This, said 
Theuth, will make the Egvptians wiser and give them better memories; it is a 
specific both for the memory and for the wit. Thamus replied: O most ingenious 
Theuth, the parent or inventor of an art is not always the best judge of the utility 
or inutilitv of his own inventions to the users of them. And in this instance, vou 
who are the father of letters, from the paternal love of your own children have 
been led to attribute to them a quality which they cannot have; for this 
discoverv of yours will create forgetfulness in the learners’ souls, because thev 
will not use their memonies: thev will trust to the external written characters and 
not remember of themselves. The specific which you have discovered is not an 
aid to memory, but to reminiscence, and you give vour disciples not truth, but 
onlv the semblance of truth; they will be hearers of many things and will have 
learned nothing; they will appear to be omniscient and will generally know 
nothing; they will be tiresome company, having the show of wisdom without the 
reality. (Ref. 18] 


In the fourth century before Christ, wise Athenians were debating the uses and 
evils of literature itself. As Plato points out in the above passage toward the end of his 
dialogue, literature, the act of writing down our ideas, can have its pitfalls. It can 
weaken our memories. (Let anyone who has never forgotten where he parked his car 
argue this point.) It can also lead to intellectual cheating. The wisdom of a man 
capable of piecing together an article with the aid of a hbrary and a long weekend may 
indeed be less than the wisdom of a man who can stand before a learned crowd and 


speak an intelligent discourse. 
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Nevertheless, civilization has embarked on a course inseparable from literature 
and writing. Wealth, power and knowledge has followed the progress of the written 
word. Literature has made us strong, but to the extent that we rely upon books 
instead of our minds, we have grown weak. Now, at what we call the dawn of the 
information age, We would do well to remember the reservations of Plato. Electronic 
computation offers us dazzling abilities, but to the extent we rely upon it instead of our 
minds, we will grow weak. 

Therefore, any stvle checker or CACI program should be used as an interesting 
tool that provides a fresh perspective on writing. These programs cannot take and 


should not be put in the place of thoughtful readers, editors, teachers and friends. 


APPENDIX A 
SUGGESTED EXTENSIONS TO THE STYLIST 


1. ABBREVIATIONS 

Reader's Read _Intext procedure ignores embedded commands such as 
“embedded” by keeping count of the number of words read since the end of the last 
sentence. If only one word has been read when another period is encountered, 
Read_Intext does not signal end_of sentence. This simple feature also allows 
Read _Intext to ignore the ellipsis ("...”). It doesn’t always allow it to ignore 
abbreviations such as “Mr.”, “Dr.”, “etc.”, or “e.g.”. If these abbreviations appear as 
the first word of this sentence, Read_Intext doesn’t signal end_of sentence. If the 
abbreviation appears somewhere in the middle of the sentence, Read_Intext does signal 
end of sentence. An example is the following sentence : “You should have told Prof. 
Wu about this earlier.” Read Intext would signal end_of sentence at “Prof.” and 
“earlier.”. This is a shortfall of the Reader module, but one which I haven’‘t corrected 
for four reasons: first, such occurrences are relatively rare; second, breaking the 
sentences into two always favors the user; third, the length of sentence measure is not 
the thrust of The Stvlist; fourth, the solution would involve unwanted overhead in 
computation. 

Writer's Workbench solves this problem for at least 48 abbreviations. 
Presumablv, every time a period is encountered, Writer’s Workbench checks a 
dictionary of these 48 abbreviations. If the word preceding the period matches one of 
these abbreviations, then Writer’s Workbench does not count this as an end to a 


sentence. A similar solution could be implemented for The Stylist. 


2, TRANSLATION TO TURBO PASCAL 

The design of The Stylist kept in mind the working environment of a personal 
computer with 1.5 megabytes of main storage. A good follow-on project for The 
Stylist would be to translate the code from Waterloo Pascal to Turbo Pascal. Copies 


of this program could then be distributed and possibly even marketed. 


3. IMPROVEMENTS OF THE CONCORDANCE 
In addition to the main CST and the FourDix, a third CST could be added. This 


new CST’s node’s key values would be characters of punctuation. During the 
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execution of Inorder_ Concordance, the procedure that creates the concordance, the 
traversals “lastward” and “nextward” could terminate upon encountering a character of 
punctuation in this third CST. This would mean the lines of the concordance would 
only contain phrases extracted from single sentences. This may or may not be a 
desirable feature. I myself found it useful to read entire phrases, even when they 
overlapped into preceding or succeeding sentences. 

\Mfore sophisticated concordances include at the end of each line a note as to 
Where this line can be found. The third CST outlined above could contains as a field in 
Its main node just such a note, with the name of the user’s input text, the page number 
and the line number. Such an extension would be valuable for scholarly research. For 


the short, single input texts, such notes are not necessary. 


4. PARTS OF SPEECH 

A considerably more ambitious extension would be to provide The Stylist the 
facility of determining the parts of speech. Other programs [Ref. 8] [Ref. 9] [Ref. 10] 
solving this problem use thousands of rules. The disambiguation of parts of speech 
would raise The Stylist to a whole new order of complexity. It would also allow more 
fine analysis of the characteristics of the words. For example, “like” as a verb has 
much different connotations than “lke” as a preposition. This facility would also 
allow other stylistic measures to figure into the profiles, such as, verb-adjective ratio 


and percentage of modifiers. 


>: PASSIVE VOICE 

The Researcher module could be extended to detect the use of passive voice. 
Ideally, this would be accomplished in the context of the extension involving parts of 
speech, so that active verbs would also be detected. A simpler solution is readily 
available, however. Researcher could raise a flag every time it receives a form of the 
verb, “to be”: that is, “is”, “are”, “am”, “was”, “Were”, “being”, and “been”. This flag 
would remain raised for the next three words. If any of these three words ended in 
“-ed”, then Researcher would count this as an example of the use of the passive voice. 
Again, I haven’t implemented this solution because it is not central to the concept of 


affective modeling and because I didn’t want to pay for the extra computation. 


6. CORRECTION OF POOR DICTION AND GRAMMAR 
Another ambitious extension of The Stylist would be to provide it with the 


facilitv of correcting poor diction and grammar. Both RightWriter and DICTION of 
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Writer's Workbench contain dictionaries of commonly used cliches or repetitive 
redundancies. Some examples from DICTION are: “a great deal of’, “in regards to”, 
“make adjustments to” and so on. One solution for The Stylist would be to include a 
new field in the dictionary. Words like “deal”, “regards” and “adjustments” would 
contain in this field a pointer to a table containing the poor phrases and their 
replacements. Such a solution would also require Researcher to maintain a phrase of 
the three or four most recently received words, so that it could verify that “regards” 
appeared in the offending context of “in regards to”. 

Grammatical corrections would require an even more ambitious extension. To 
perform properly, such a facility would have to be an extension of the parts of speech 
extension. Determining agreement between subject and verb, for example, would 
require first the identification of the subject and the verb. Frankly, it would be easier 
to incorporate the affective modeling of The Stylist into a more extensive program such 
as Writer's Workbench than it would be to extend The Stylist this far. 


APPENDIX B 
USER’S MANUAL 


lL. INTRODUCTION 

The Stylist is a Waterloo Pascal program that analyzes the stvle of English prose, 
both fiction and nonfiction. The Stylist package contains the following : this User’s 
manual, and the following electronic files : the Waterloo Pascal code of The Stylist 
itself, the text file dictionary of The Stylist, and two Waterloo Pascal programs for 
helping to maintain the dictionary, DIXSPLIT and DIXJOIN. 

The fundamental idea of The Stylist is that individual words have power. The 
denotation of a word is its meaning. The connotation of a word is its emotional 
impact. For example. “pupil”, “student”,and “scholar” all denote a person who studies 
and learns. The emotional connotations, however, range from the humble “pupil” 
through the familar “student” to the lofty “scholar”. Besides these emotional 
connotations, words have other intrinsic values which can be quantified. The Stylist 
concentrates on the following values: Etvmologv, Tangibilitvy, Difficulty, Emotional 
Connotation and Vigor. 

a. Etymology 

One of the beauties of the English language is that its vocabulary embraces 
two main sources : native and borrowed. Our native words mainly come to us from 
the Anglo-Saxon tongues. Borrowed words come from Latin, mostly by wav of the 


French of the Norman conquest. See Table 7, which ulustrates the difference. 











TABLE 7 
EXAMPLES OF ET Gre LOG 


Native Words Borrowed Words 





Man Peis ou 

Banger Daeres 

Thinker Philosopher 
Eare Conflagration 
Book Volume 


Dirtbag Miscreant 
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Native words tend to be short, strong and rough. Borrowed words tend to be 
long, mild and elegant. Good English prose, particularly good fiction, tends to use 
native words. Poor English prose, particularly bad technical wniting, exhibits a 
tendency to overutilize latinate etvmology. By counting up the number of native and 
borrowed words in your input text, The Stylist sees how it compares to good writing of 
the appropriate genre. 

b. Tangibility 

A word can either evoke an image of a thing or it can refer to an idea. The 

former words are “tangible” and the latter are “intangible”. Table 8 illustrates the 


difference. 


TABLE 8 
EXAMPLES OF TANGIBILTY 


TANGIBLE INTANGIBLE 
Rock Ethereal 
nce Automatically 
Beehive Then 
Sorvecce Bee 
Hammer Preliminary 


Tangible words are concrete, exact and evocative. Intangible words are ideal, 
general and cognitive. All writing uses both. Good writing, however, usually takes 
advantage of tangible words. Even the most philosophical writing benefits from the 
use of tangible words. Good fiction rarely strays too far toward the intangible. The 
Stvlist counts the tangible and intangible words of your input text and compare this 
count to good writing of the appropriate genre. 

c. Difficulty 

This is one of the most indisputable characteristics of words. The Stvlist 
defines four levels of difficulty : Elementary, High School, Graduate and Postgraduate. 
The words of each level are those most likely to be used in speech with ease by an 
average person of that educational level. Table 9 illustrates the categories. 

By counting the occurrences of difficult words, The Stylist could determine the 


overall difficulty of the vocabulary and thus the readibility of your text. 
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TAI. 
EXAMPLES OF DIF T terns 


ELEMENTARY hich sence GRADUATE POSTGRADUATE 


Big Tardy Matrix | Execrable 
Sister Rendezvous Immaterial Parsimony 
Sooo Transmission Tonaliey Recursive 
Process Lnductioenm Dilatory 
Done Joyous Procesces aygote 
Handsome Undergo Linear Synergy 





| 
| Ey aig Foreman Universal Homomorph 
Wish 


d. Emotional Connotations 


The Stylist defines five categories of emotional connotation : Sublime, 


Pleasant, Neutral, Unpleasant and Horrid. Table 10 illustrates the categories. 


| 
| TABLE 10 
EXAMPLES OF EMOTIONAL CONNOTATION 


SUBLIME PLEASANT NEUTRAL UNPLEASANT HORR ES 


Beauty Happy The Damage Cancer 
Sunrise Food Which Loss | Murder 
Vie CORy Friendly Brier Pasi hcl hc Whore 
Love Warm Is Loser Fucks 
God _ Helpful Name Cesc Death 
Faradise Sex When Wound [ral Gen 


By counting the use of these types of words, The Stylist could determine the overall 
emotional tone of vour passage. It could also detect flat or highly emotional writing. 
e. Vigor 

A related but distinct characteristic is vigor. Sublime and horrid words tend to 
be highly vigorous, but not all highly vigorous words are emotional. Examples of 
vigorous but unemotional words are “sprint”, “rush” and “cross”. This category also 
tends to be more objective. For example, the word, “soldier” can have widely different 
connotations for different people. The word “soldier” would please a career Army 
officer but it would displease a survivor of the Japanese occupation of Canton. Both 


could agree that “solider” is a vigorous word. 
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WABEE 1 
EXAMPLES OF VIGOR 


VIOLENT ENERGETIC CALM INERT 


Destroy Saaeelaice Read 

Creation Dive. , Nia 
Dag Discipline Manager something 
Atomic Wedding O Comma 
Momecals © steam Paper 
Conqueror Biyang sought Format 





By calculating the vigor the words of your input text, The Stylist would be 


able to estimate its overall strength. 


2. USING THE STYLIST 
The first step to using the Stylist is to save your piece of writing as “Input text 
a”. You should be aware of the following principles: 


1. Your input text should be at least 500 words long. The Stylist will examine 
shorter texts, but the statistical sample of shorter texts 1s too small for valid 
analvsis. 


2. Your input text should not be longer than 500 sentences (about 10,000 words). 
Such long texts require so much computation that most operating svstems such 
as MVS will terminate execution before completion. 


3. The Stylist 1s not meant to analyze non-prose constructions such as tables, lists, 
references and bibliographies. You should eliminate these from your input text. 


In the multiprocessing environment, DEFINE MAIN STORAGE at 1500k. The 
VM/CMS command ts “Define storage 1500k” followed by “I CMS”. 

Once vour input text 1s properly stored, execute The Stylist. DO NOT interrupt 
The Stylist during compilation or execution by hitting any kevs. Always wait for The 
Stylist to display a screen with a prompt. After The Stylist is compiled, you will see 
just such a series of screens that will ask you for information. The following 
discussions will help you decide how to answer. 

a. Name of Intext 

The first screen will ask you for the name of your input text. Simply tvpe in 

the name. You can use any characters. The maximum length of the name is 40 


characters. Hit “enter”. 
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b. Fiction or NonFiction ; 
The screen will automatically clear. A new screen will ask vou if your input 
text is fiction or nonfiction. Enter “I” or “2°. If vou enter any other charactemmumae | 
Stvlist will merely ask vou again. 
c. Report Frequency 
The next screen asks whether you want reports “frequently”, “seldomly” or 
“when done”. A report is a screen which The Stylist displays to you during execution, 
telling vou what it’s doing, how much it’s done and how much remains to be done. 
The act itself of producing such takes time : the more frequent the reports, the longer it 
takes. You should probably enter “2” for “Seldomly”. Most users find this the most 
agreeable. 
d. Concordance 
The next screen asks whether or not you want a concordance. A concordance 
is an alphabetical listing of all the significant words of your input text. (“Significant” 
words are all words other than functional words such as “the”, “and”, “as”, and so on.) 
Each line of the concordance contains the key word and the phrases preceding and 


succeeding it. The following is an example of a concordance. 


them and thamus enquired ABOUT their several uses and 
discovered is notan AID to memory but to 

other egvptians might be ALLOWED to have the benefit 

an artisnot ALWAYS the best judge of 

is called by them AMMON to him came theuth 

learned nothing they will APPEAR to be omniscient and 
censured others ashe APPROVED or disapproved of them 
many arts suchas ARITHMETIC and calculation and geometry 
or inventor ofan ART 1s not always the 

the inventor of many ARTS _ such as arithmetic and 

olame of the various ARTS. but when they came 

calculation and geometry and ASTRONOMY and draughts and dice 
have been ledto ATTRIBUTE to them a quality 

in the learners’ souls BECAUSE  thev will not use 


+our own Children have BEE N@ led to atimobute: 76 


Most users find that a concordance offers them interesting insights into their 
use of words. The only drawback of producing a concordance is that it takes The 
Stvlist additional time. If you elect this option, your concordance will be printed to a 
separate file called, “Concrdnec text a’. 

e. Number of Words in Intext 

The next screen asks vou for the number of words 1n your text. If you don't 
know, make any guess. This number is only used in the screens displaying how far 
along The Stvlist is in reading the text. Your guess will have no effect on the analysis 
of the text. The Stylist counts the words itself and will report the exact number to vou 
when it finishes. 

f. Expand the Dictionary 

The next screen will ask whether you want to expand the dictionary of The 
Stvlist or not. This dictionary is a listing of about 3200 of the most commonly used 
words, together with five values for each word. If you answer “1” for “Yes”, the Stvlist 
will ask you to give it a values for every word in you input text which 1s not found in 
the dictionary. It will then add these new words and their values to the dictionary. If 
you answer “2” for “No”, then The Stylist will write all of the unknown words to a file 
called “Newwords text a”. Reviewing “Newwords” will give vou an idea of the words 
that you use which aren't in the dictionary. 

The first few times that you use The Stylist, you probably should answer “2” 
for “No”. Entering the values for the new words can be tedious. You probably want 
to see how The Stvlist works before you begin to expand or modify the dictionary. 
After vou've used The Stvlist for a while, you'll probably want to begin adding some 
words to the dictionary. Words peculiar to your field of writing or the jargon of your 
profession are examples of the types of words which should be entered. Adding them 


will tailor The Stylist to your working environment. Keep the following in mind: 


*** Every word added to the dictionary is another word 
that will have to be loaded into memory during each 
execution of The Stylist. Adding many words will slow 
down execution and it will decrease the amount of space 
in main storage. If the dictionary grows beyond 5000 
words, you should weed out words that are rarely used 
in your type of writing. To do this, simply delete 


iauiGcinom tae ile DMinonary texta . *** 
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If you answer “1” for “Yes”, as the Stylist encounters each new word in vour 
text, it will ask if vou want to add this particular word to the dictionary. If the word is 
a misspelling or if it is a Word that you rarely use, don’t add it. Also, you should know 
that its best to add the root form of words. For example, neither “burns” nor “burned” 
nor “burning” should be added. It’s much better to add “burn”. To keep the 
dictionary as compact as possible, note down the root form of words you wish to add 
to the dictionarv and append them to the end of the next text you run through The 
Stvlist. 

The Stylist will then ask you for the values of the words you do want to add 
to the dictionary. The following will help you decide how to answer these prompts : 

I. Source 

If vou’re uncertain about the etymology of the word, simply look it up ina 
collegiate dictionary. Old English, Middle English, Norse, German, Old German, 
Middle German and all Celtic languages are considered “Germanic”. Latin, French, 
Spanish, and all other languages are considered “Latin”. 

2. Difficulty 

Each category contains those words which are most likely to be used in 
speech by persons of that educational level. If you're still uncertain of the meaning of 


these categories, the following are examples of the categories : 
POSTGRADUATE DEH FICE Rin 


aggregate 
algorithm 
algorithmuc 
ambiguity 
analogue 
applicative 
conceptualize 
conglomerate 
constriction 
consummate 
consummation 
convoluted 


convolution 


4-1 


culminate 
cyclical 


digitize 


GRADUATE DIFFICULTY 


accumulate 
acrid 
acute 
adhere 
adjoin 
allocate 
allude 
analogy 
analysis 
analvst 
ancestor 
ancestral 
annihilate 
apparatus 
appendage 
arc 

arch 


array 


fled SCHOOL DIFFICULTY 


abandon 
ability 
able 
abrupt 
absence 
absent 
absorb 
acceptabie 


access 
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accommodate 
accomplish 
accomplishment 
account 
accurate 
achievement 
activate 

adapt 
additional 
adjust 


advantage 


ELEMENTARY DIPRICwWEiy 


about 
above 
accept 
acceptance 
ache 
across 
act 
action 
active 
actiVIty 
actor 

add 
addition 
address 
admit 
advance 
adventure 
afraid 
after 
afternoon 


again 
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3. Concreteness 
Does this word name a thing that you can touch? If it does, then answer 
“\” for “Tangible”. Otherwise, answer “2”. 
4. Emotional Connotation 
What feelings does this word evoke in you? If none, answer “3” for neutral. 
Otherwise, answer in the appropriate category. 
5. Vigor 
Vigorous words are not necessarily verbs. Words like “conquest” are highly 
vigorous. “Violent” vigor means “highly” vigorous. Some of the words in the “violent” 
category are “creation”, “triumph” and “epiphany”. This category, as does emotional 
connotation, calls for your subjective reaction. Don’t be afraid to “guess” which value 
1s appropriate. Such a “guess” is in fact your subjective reaction. 
g. The End of Execution 
The above are all the screens which prompt you for answers. All the rest of 
the screens will merely inform vou about the progress of the execution. When 
execution ends, you should look into the following files : “Newwords text a”, if you 
@eciaed not to let the dictionary expand; “Concrdnc text a”, if vou elected to have a 
concordance produced; and most importantly, “Report text a”, which is The Stvlist’s 


report on your text. 


3. THE MEANING OF YOUR REPORT 
a. Measures of Length 

The first two pages of your report deals with the number of words. the length 
of words and the length of sentences. These measures are always interesting but not 
alwavs uluminating. The first bar graph shows the distribution of the length of the 
words of your text. Each horizontal column represents the percentage of the words of 
your next which were of so many letters of length. The overall bar graph should look 
hike a bell-shaped curve. This bar graph does not include function words such as “a”, 
“an”, “the”, “then” and “or”. The blurb below the bar graph explains how vour text 
compares to other texts of the same genre. The only clear indication of prose in 
trouble would be a dramatic slewing of the curve to the right. This would mean that 
you're using too many windy words. Table 12 breaks down the characterizations of 


word length. 
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TABLE 12 
WORD TENG Ta 


Average Word Length 
lower boundary) 


Gnaracterizaewven Fiction Noniicerem 
Tee Lond 6 i) 
Long 5.2 ee 5 
Medium & 5a 
Snore & 5 
Too) Snore OQ O 


The second bar graph shows the length of the sentences of your input text. 
Each bar represents the length of a sentence. You can use this graph to identify run- 
on sentences. Overall, this bar graph should resemble the skvline of a modest city such 
as San Francisco. If it resembles New York, then you tend to write long sentences. 


Table 13 explains the characterizations of sentence length. 


TABEE AS 
SENTENCE LENGE 


| Characterization Lower boundary of length 
| Toe Long Ze 
| Lon 18 

Medium ee) 

Shere 10 

VWeCmsieCre 0 


One of the measures below this bar graph is “Number of sweet spots”. This 1s 
the number of sentences ranging between 9 and 19 words. The idea here is that most 
ideas can find a home in a sentence of this length. Table 14 breaks down how The 
Stylist characterizes the modulation of your sentences. 

Another count here is the number of run-on sentences. The Stylist defines 
anv sentence over 45 words in length as arun-on. The Stylist realizes that many grand 
sentences over 45 words in length are not run-ons, but it’s willing to bet that most 


overlong sentences are. 
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TABLE 14 


MODULATION 
Percentage Characterization 
Above 50% Good 
ae xe 50% Average 
Tow 20° Poor 


b. Measures of Word Characteristics 
I. Etymology 
The next item in the report is the number of Latinate and Germanic words. 
Notice that these two numbers will not add up to the total number of words in the text 
unless every word in the text happened to belong to the dictionary. The 
characterization of your text in this category depends on the ratio of latin to germanic 
words. Good writing generally tends to be native. Table 15 breaks down these 


characterizations. 


TASLE Is 
CHARACTERIZATIONS OF ETYMOLOGY 


Ratio of Latin to German 
Characterization PLetien NonFiction 
Very Borrowed Zia So 
Borrowed Ss: : 
Mixed 
Native | 
Very Native 





2. Difficulty 
The next entry shows the tally of the numbers of words of the various 
levels of difficulty, followed by the percentage of those levels. The difficulty of the 
text's vocabulary should be appropriate for its intended audience. Moreover, you 
should remember than many outstanding works of literature convey difficult ideas 
Without resorting to difficult words. Table 16 describes the characterizations of this 


category. 
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TABLE 16 | 
CHARACTERIZATIONS OF DIFFICULTY ) 


Characterization Cause 


| 
E ante z 
| 


Very Hard Zeostgrad > O 
rad a 

Challenging Oraigneeneen = 10 

Easy above percentages 


do not pertain 


3. Tangibility . 
The next entries show the total of the tangible and intangible words. As in 
the etvmology categorv, the characterizations depend upon the ratio between these 


words and the genre. Generally, good writing is tangible. 


TABEES: 
CHARACTERIZATIONS OF TANGIBILITY 


| 
| 
Ratio of Intangible to Tangible 


Characterization Ele ero) Nonfiction 
Very tangible oleae 2 
Tangible Le Zo 

| Very intangible oe 4:1 


4. Emotional Connotation 
The next breakdown is of the number and percentages of the words 
belonging to the various categories of emotional connotation. Following these values 
1s something called the “Index of Emotion”. The following 1s the formula used to 


derive this index : 


Index = (Percent Horrid times 5) + 
(Percent Unpleasant times 2) + 
(Percent Pleasant times 2) + 


(Percent Sublime times 5) 


The Index of Emotion is simply a number that conveys a sense of the use 
of strongly emotional words. The most emotional words, Horrid and Sublime, are 
weighted the most heavily. This Index allows The Stylist to compare your use of 
emotional words to the use found in other texts. Table 18 shows the thresholds of 


such uses. 





TABIEESKS 
CHARACTERIZATIONS OF EMOTIONALITY 


Characterization Lower Boundry of Index 

| Rich ZO | 
Average 10 
ror 0 





Following this characterization is another one, this time for the overall tone 
of your text. The Stylist compares the frequency of use of positive words to the 
frequency of use of negative words. If positive words prevail, the tone is positive. If 
negative words prevail, the tone is negative. If the two balance and the index of 
emotion is high, then the tone is characterized as a balance of strong positive and 
strong negative emotions. If the two balance and the index of emotion is low, then the 
tone 1s characterized as bland. 

5. Vigor 

The final breakdown 1s that of the number and percentage of words of the 
various categories of vigor. Following this is an “Index of Strength’, which is a 
measure similar to the Index of Emotion. The following is the formula used to derive 


the Index of Strength : 


Index of Strength = (Percent of violent words times 10) + 
(Percent of energetic words times 5) + 
(Percent of calm words) 
This Index allows the Stvlist to compare your use of vigorous words to uses 
found in other texts. Table 19 explains how The Stylist uses the index to characterize 
your text. Most good writing is strong or lively. Usually only action-packed fiction 


reaches the upper registers of “very strong”. 
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TABLE 19 
CHARACTERIZATIONS OF VIGOR 


| Characterization Lower Boundry of Index 
| Veny ctEong 60 
| Sen oncd 50 
Lively Zo | 
| Weak 0 | 


6. Recommendations 

The final section of your report includes recommendations and 
commendations. The Stylist is meant to be a descriptive rather than a prescriptive 
tool. Therefore, it makes recommendations only when some aspect of your wniting 
seems Well outside of the bounds of the normal. For this reason, you should consider 
the recommendations of The Stylist. They apply only to what appear to be extreme 
cases. On the other hand, you should never accept these recommendations as the 
pronouncements of some oracle. The Stylist 1s merely a machine. When in doubt, 


trust Vour own instincts. 


4. MAINTENANCE OF THE STYLIST 
a. Care of the Dictionary 
Many of the issues regarding the care of the dictionary have already been 
covered. Here are some other helpful pointers : 


1. Always maintain more than one copy of the dictionary. Loss of the dictionary 
renders The Stylist useless. 


td 


Beware of changing the values directly. You should never do this to the 
dictionary itself. While using DIXSPLIT, it is necessary to go into the split 
files and change the codes for the words which don’t belong in that split file. If 
vou make a mistake, DIXJOIN will merely assign the most common value for 
that category. 


b. Changing the Code of The Stylist 
The Stylist comes with embedded documentation. Make sure that you read 
this documentation prior to changing any of its code. You should read the full 


master’s thesis pertaining to The Stylist prior to making any substantive changes.” 


>See Master's Thesis : The Stylist: A Pascal Program for Analyzing Prose Style, 
by Lt. Thomas C. Cool, USN, Naval Postgraduate School, Monterey, California, 1987. 
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Substantive changes should also bear in mind the modularization of the 
design. Reader module bears full responsibility for reading the user’s input text. It 
passes along words (in lower case) and their length. It also signals the ends of 
sentences and the end of the file. Researcher Module bears full responsibility for 
maintaining and using the dictionary, for building up the profile of the user's input 
text, and for building the concordance. Finally, Reporter module accepts the profile, 
creates the analysis and writes out the report. 

Some fine tuning of The Stylist is readily available. The global constants give 
vou the capability to adapt The Stvlist to vour operating environment. For example, 
“Maxsent” allows you to determine the maximum number of sentences in the input 
text. Moreover. by manipulating the weighting factors in the Calculate procedures of 
the Reporter module, you can adapt The Stvlist to the norms of a particular school of 
writing, such as submissions to magazines as varied as “Fantasy and Science Fiction” 


and “Foreign Affairs.” 


APPENDIX C 
THE CODE OF THESE 


(x NOTE : THE FOLLOWING IS NOT IN EXECUTABLE ORDER. 
IT IS IN THE ORDER OF GREATEST CONCEPTUAL CLARITY. X) 

(*$S60000%) 

(36 KK HEE IE IE IE HE HE HE HE HE HE HE HE HE HE HE EE EE HE HE HE HEHE HEE HE HE HE HE HE IE EE HE EE HE HE HE HE EEE EH HHH HHH HHH KK KKK ) 
(x *) 
(x THE STYLIST *) 
(x *) 
(x a Pascal Program for Analyzing Prose Style *) 
(x by Lieutenant Thomas C. Cool, USN *) 
Cx Submitted in partial fulfillment of the requirements 2 
Cx for the degree of * ) 
(x MASTER OF SCIENCE IN COMPUTER SCIENCE ¥) 
(x from the ¥ ) 
(x NAVAL POSTGRADUATE SCHOOL ¥*) 
(x June 1987 *) 
(x ¥*) 


(HE 3 HE HE HE HE IE KK HE FE HE HE HE HE HE HE HE SE DE FE HE HEE FEE HE HE HE IE HE HE HE HE HE HE HE HE HE HE HE HE HE HE HE HE IE IE HE HE HE HE EE HE HEE HEE HEE EE EE KKK) 


program Stylist Cinput, 


const 
maxsent = 500; 
wordlength = 15; 


linelensth = 40; 


output); 


(¥ limit of number of sentences in intext *) 


Phraselength = 4; (¥ length of phrase around condcordance linexX) 
freqnum = 50; (¥ the three numbers determine how often re) 
selnum = 200; (*% Stylist reports to user during execution: *®) 
whendonenum = 1000000; (*® frequently, seldomly or when done *) 
type 
(x GLOBAL TYPES *) 
Genretype = (Nonfiction, Fiction); 


Frequencytype = (Frequently, Seldomly, When_Done); 

Wordtype = packed array (.]..wordlength.) of char; 

Linetype = packed array (.1..linelength.) of char; 

User_Infotype = record 
Name_of_text linetype; 
Genre Genretype; 
Size Integer; (*% users guess of intext length *) 
Frequency Frequencytype; (¥ how often user wants reports *) 
Wlant_Concord boolean; 


Want_Dix_Grow boolean; 
end; 
(x READER MODULE TYPES x) 
Readtype = record (% created by Reader, passed to Researcher *) 
Word wordtype; 
Length integer; 
end; 


54 


(x 


RESEARCHER MODULE TYPES *) 


Balancetype = (Plus, Zero, Minus); (* Balance of Node in BST xX) 
Statustype = (Storable, Not_Storable, Not_Valuable); 
(¥ Storable - a full node with all values which ¥ ) 
(x 


can be added to the dictionary ¥ ) 
(¥ Not _Storable - a node of a variant (-ing, -ed) *) 
(x word which has values but should ¥ ) 
(x not be added to dictionary * ) 
(¥ Not_Valuable - a node of a new word without %) 
(x values. Cannot be Add_Valued and *) 
(x cannot be added to dictionary. ¥ ) 

Sourcetype = (Latinate, Germanic); 


Difficultytype = (PostGrad, Grad, High_School, Elementary); 
Concretenesstype = (Tangible, Intangible); 


Emotiontype = (Sublime, Pleasant, Neutral, Unpleasant, Horrid); 

Vigortype = (Violent, Energetic, Calm, Inert); 

(* tags which describe each word in the dictionary ¥) 

EntryPointer = -Entrytype; 

ConcordPointer = -=Concordtype; 

Entrytype = record (* main node in the BST X) 
Word : Wordtype; (* word is Key field ¥) 


Balance : Balancetype; 

Status : Statustype; 

Source : Sourcetype; 

Difficulty : Difficultytype; 
Concreteness : Concretenesstype; 
Emotion : Emotiontype; 

Vigor : Vigortype; 

Left, Right : EntryPointer; 


Down : Concordpointer; 

end; 

Concordtype = record (*¥ a Concord node is a cluster of pointers xX) 
Up : EntryPointer; (*¥ which create linked lists from the first*) 
Down, CX word of the Intext to the last, and from*) 
Next, (¥ the last to the first. Used only to ¥ ) 

Paco : ConcordPointer; (* create the concordance. ¥ ) 
end; 


a) 


LtrsPerWordtype = array (.0..wordlength.) of integer; 
(* array of total number of words of each length ¥ ) 


WordsPerSenttype = array (.0..maxsent.) of integer; 
(* array of the length of each sentence 


Profiletype = record (* the profile is the main product xX) 
(*X of the Researcher Module. Here *) 
(* Researcher keeps track of all ¥ ) 


(*¥ counts. ¥*) 
Totalwords : integer; €% Total number of words in intext *) 
Totalletters : integer; 
Totalsentences : integer; 
TotalStrucltrs : integer; CX Total of ltrs of structural wrds *) 
NumWlordsThisSent : integer; (* # words in sentence being read xX) 
Newwords : integer; (*% # words user adds to dixonary ¥*) 
KnownWords : integer; €% # words of intext found in dixon.*) 
VariantWords : integer; C€* # variant words of intxt in dixon*) 
Structurewords : integer; CX # stucture words of intxt in 4dixxX) 
UnknownWords : integer; Cx # intext words not found anywhere) 
Numfound : integer; CX # total words minus unknown words*) 


LtrsPerWord : ltrsperwordtype; 
WordsPerSent : wordspersenttype; 
Ave_ltrs_per_word : real; 
Ave_wrds_per_sent : real; 
NumLatinate : integer; (* rest of profile are counts of X) 
NumGermanic : integer; (*% words of the intext, found in X) 
NumPostGrad : integer; €% the dictionaries, which have xX) 
NumGrad : integer; (% these corresponding values ¥* ) 
NumHigh_School : integer; 
NumElementary : integer; 
NumTangible : integer; 
NumIntangible : integer; 
NumSublime : integer; 
NumPleasant : integer; 
NumNeutral : integer; 
NumUnpleasant : integer; 
NumHorrid : integer; 
NumViolent : integer; 
NumEnergetic : integer; 
NumCalm : integer; 
NumInert : integer; 
end; 


Phrasetype = packed array (.1..phraselength.) of wordtype; 
(*% holds the phrases bracketing the Key words of concordance line X) 
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(x REPORTER MODULE TYPES ¥ } 


Lengthtype = (TooShort, Short, Medium, Long, Toolong); 
Etymologytype = (TooBorrowed, Borrowed, Mixed, Native, TooNative); 
Modulationtype = (Bad, Average, Good); 

Runonstype = (Unacceptable, Acceptable, Nonexistant); 
Tangibilitytype = (Soft, firm, Solid); 

Strengthtype = (Weak, Lively, Strong, VeryStrong); 

HardWordstype = (Easy, Challenging, Hard, VeryHard); 
Emotionalitytype = (Poor, Standard, Rich); 

Tonetype = (Negative, Bland, Positive); 

(¥ Judgements of intext style Reporter makes based on profile *) 


Analysistype = record 
WordLength : Lengthtype; 
SentLength : Lengthtype; 
Modulation : Modulationtype; 
Runons : Runonstype; 
Etymology : Etymologytype; 
Tangibility : Tangibilitytype; 
Strength : Strengthtype; 
Hardwords : HardWordstype; 


Emotionality : Emotionalitytype; 
Tone : tonetype; 
end; 
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Cx GLOBAL VARIABLES *) 


var 
info : User. Infotype, 


Allswell : boolean; (%* global flag that execution can continue x) 
(* used instead of a GOTO end of program ¥* ) 


FourRoot, (X Root of the FourDix BST *) 

Root : EntryPointer; (X¥ Root of the Dixonary BST X) 

Lastward : ConcordPointer; CX Pointer to last word read X) 

Intext, (* file that holds users prose x) 
Dixonary, €* dictionary of words and values *) 
FourDix, (¥ diction. of short, functional words *) 
Coneradane, (* concordance, created by researcher *X) 
NewWNords, CX list of words not found or added *X) 
Report : text; (*¥ main product of Reporter xX) 

Tally, (* Researcher scratch pad of profile xX) 
Profile : profiletype; (* Researcher input to Reporter ¥ ) 


Analysis : analysistype; (€* Judgements made by reporter *X) 


CX MAIN PROGRAM ¥ ) 


begin (X main program xX) 

Interrogate_User; 

if Allswell then Init_Reader; 

if Allswell then Init_Researcher; 

if Allswell then Init_Reporter; 

if Allswell then Read_Intext; 

If Allswell then Calculate_Profile; 

If Allswell then Analyze_Profile_and_Report; 

If Allswell then if Info.Want_Concord then 

Write Concordance; 

If Allswell then if Info.Want_Dix_Grow then 
Store_New_Dix; 

If not Allswell then 
Writeln C'Execution terminated until you clear that up.'); 
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procedure Interrogate_User; 
var c: char; 
good_integer : boolean; 
1, J : integer; 
begin 
Page; 
Writeln; Writeln; Writeln; 
Writeln C'THE STYLIST!:49); Wraiteln; Writeln ('by LT TC Cool!:50); 
Writeln; Writeln; Writeln; Writeln; 
Writeln (C'What is the name of the text to be analyzed?!':65); 
mec= QO; 
while (i < aces and not CEOLN) do begin 
eo = 1 + 1; 
Read (Info.Name_of_Text(€.1.)); 
end; Readln; 
for 3 := 1 + 1 to linelength do Info.Name_of_Text(€.3.) := ' '; 
Page; 
Repeat 
Writeln; Writeln; Writeln; 
Write (€'Is ', Info.Name_of_Text); 
Writein (€' saved as "Intext text a™?');Writeln; 
Writeln C(! 1) Yes le 
Writeln C! 2) No 1:45); 
Read (cc); Readln; Writeln (¢:45); 
Metz. Cc= '1') or Ceo = '2'); 
if €c = '1l') then begin Writeln ('That is good.':45); 


Allswell := true; end 
else begin Writeln ('That is too bad.':43); 
Allswell := false; end; 


if Allswell then begin 
Repeat 
Page; Writeln; Writeln; Writeln; 
Writeln CInfo.Name_of_text:55); Writeln; 
Writeln C! is 1345); 
Writeln; Writeln; 
Writeln C'1) Fiction mG): 
Writeln €'2) Nonfiction':45); 
Writeln; Writeln; 
Read (cc); Readln;Writeln (c¢c:45); 


if €c = '1l') then Info.Genre := Fiction 

else if (c = '2') then Info.Genre := Nonfiction; 
omer) €e = *1") or Ce = '2'); 
Repeat 


Page; Writeln; Writeln; Writeln; 

Writeln C! REPORT ':45); Writeln; 

Writeln C€'1) Frequently! :45); 

Wraiteln €'2) Seldomly ':45); 

Writeln (€'3) When Done ':45); Writeln; 

Read (cc); Readln; Writeln; Writeln (c); Writeln; 

if €c = '1l') then Info.Frequency := Frequently 

else if (c '2') then Info.Frequency := Seldomly 

else if (Cc '3') then Info.Frequency When_Done; 
Until €c = '1'3 or (Cc = *2') or Ce = '3'); 


~- Ol ot 
| 


be 


Repeat 


Page; Writeln; Writeln; Writeln; 

Writeln €*' CONCORDANCE ':45); Writeln; 

Writeln C! 1) Yes ae) 

Writeln C! 2) No 1G 5e, 

Read (cc); Readln; Writeln (c¢:45); 

if €c = 'l') then Info.Want_Concord := true 

else if (c = '2') then Info.Want_Concord := false; 


Until Cc= *) or Cc —-— 2s 
Repeat 


Page; Writeln; Writeln; Writeln; 
Writeln CtApproximately how many words are in '); 
Writeln CInfo.Name_of_text, ' ?':3); 


Info.size := 0; 
Good_integer := true; 
while not EOLN do begin 
Read (c); 
if “Cée-an CoO" . oO eo aenen 
Info.size := Info.size * 10 + ordt(c3.eo,rdq ou 
else begin good_integer := false; 


Writeln €ec:2, ' 1s an invalid character. '); vence. 
end; Readln; 


Until Good Enceger; 
Writeln CInfo.size:45); 
Repeat 


Page; Writeln; Writeln; Writeln; 

Writeln C' EXPAND DICTIONARY ? ':465); Writeln; Writeln; 
Writeln (C! 1) Yes Goi, 

Writeln (! 2) No 45); 

Read (c); Readln; Writeln (c¢:45); 

1f €c = '1') then Info.Want_Dix_Grow := true 

else if (c = '2') then Info.Want_D1ix_Grow: =false; 


Until Ce =. hit veruGce-.. 2s 


end; 
end; 


(X if allswell xX) 
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procedure Init_Reader; 
begin 

Reset (Intext, 'Intext text a'); 
end; 


procedure Init_Researcher; 
var 
ae 
stepcount, 
threshhold, 
dixlength : integer; 
entry : entrytype; 
balanced : boolean; 
begin 
Reset (Dixonary, 'Dixonary text a'); 
Rewrite(Newwords, 'Newwords text a'); 
if Info.Want_Concord then Rewrite (€Conerdne, 'Conerdne text a'); 
if (Info.frequency <> When_Done) then begin 
Dixlength := 0; 
while not EOF (CDixonary) do begin 
Readln (Dixonary); Dixlength := dixlength + 1; 


end; 

Reset (Dixonary, 'Dixonary text a'); 
end; 
case Info.Frequency of 


Frequently : threshhold := freqnum; 
Seldomly : threshhold := selnum; 
When_Done : threshhold := WhenDonenum; 


end; (* cases *) 
New CRoot); 
Boada (2, Root-); 
Root-.left := nil; Root-.right := nil; 
Root-~.down := nil; 
Root-.balance := Zero; 
Root-.status := Storable; 
Lastward := nil; 
Stepcount := 0; 
while not EOF aaconsre do begin 
moad (2, Entry); 
Entry.balance := Zero; balanced := false; 
AVL_Insert CEntry, Root, balanced); 
if CInfo.frequency <> When_Done) then begin 
stepcount := stepcount + 1; 
if (stepcount mod threshhold = 0) then begin 
Page; Writeln; Writeln; Writeln; Writeln; 
Writeln C'Stylist is now loading its dictionary. ':50); 
Wraiteln; Writeln; 
Wrateln C! Last word loaded was Soa), 
Writeln Centry.word:40); Writeln; Writeln; 
Writeln C'Stylist has loaded ':30,stepcount:5,' entries. ")3 
Writeln C'out of a dictionary of':33,Dixlength:5,' words.'); 
Writeln; Writeln; 
Writeln (CStepcount*¥100 dav Daxlength:33, ' *% complete.'); 
end; (X if X) 
end; (*% if *) 
end; (€*® while not EOF xX) 
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with tally do begin 


totalletters := 0; 
totalwords := 0; 
totalsentences := 0; 
totalstrucltrs 1: =20; 

numwordsthissent := 0; 

for i := 0 to wordlength do ltrsperword(.i.) := 0; 
for i := 0 to maxsent do wordspersent(.1.) := 0; 
KnownWords := 0; 

NewWords := 0; 

Numfound := 0; 

VariantWords := 0; 

Structurewords := Q; 

UnknownWords := Q; 

NumLatinate 
NumGermanic ; 
NumPostGrad :=0; 
NumGrad := Q; 
NumHigh_School := 0; 
NumElementary := 0; 
NumTangible := 0; 
NumIintangible := 0; 
NumSublime := 0; 
NumPleasant := Q; 
NumNeutral := 0; 
NumUnpleasant := 0; 
NumHorrid := 0; 
NumViolent := Q; 
NumEnergetic := 
NumCalm := 0; 
NumInert := 0; 


e 
? 


end; (XX with tally do xX) 


end; 


Reset (FourDix, 'FourDix text a'); 
New CFourRoot); FourRoot-.right := nil; FourRoot-.left 
Load (1, FourRoot-); 
Attach (CFourRoot-, FourRoot); 
FourRoot-.balance := Zero; 
While not EOF (CFourDix) do begin 

Load (1, Entry); 

Entry.balance := Zero; 

balanced := false; 

AVL_Insert CEntry, FourRoot, balanced); 
end; 


62 


ni ie 


procedure Load (filenum : integer; var entry : entrytype); 
var 
sourceltr, difficultyltr, 
concretenessltr, emotionltr, 
vVigorltr : char; 
begin 
if (filenum = 1) then Readln (FourDix, Entry.word, 
sourceltr, difficultyltr, 
concretenessltr, emotionltr, 
vigorltr) 
else Readln (Dixonary, Entry.word,sourceltr, difficultyltr, 
concretenessltr, emotionltr, 
Vigorltr); 
with entry do begin 
if (sourceltr = '1') then source 
else source 
case air cultyl tre of 


Latinate 
Germanic; 


'p* = difficulty .:= Postarad; 
ig’ = diffreulty ~= Grad; 

oie aatrriculty := Haigh Sehnoon; 
Te’ . difficulty := Elementary; 


end; 
if €Cconcretenessltr= 't') then concreteness 
else concreteness 


Tangible 
Intangible; 


case BOs OD LCE of 


cs. emotion := Sublime; 
™p' : emotion := Pleasant; 
™n! : emotion := Neutral; 
Tu’ : emotion := Unpleasant; 
‘Rh’ : emotion := Horrid; 

end; 

case vigorltr of 
ty! vigor = Violent; 
Te’ : vigor := Energetic; 
cc. = Vigor := Calm; 
fo vigor == inert; 

end; 

end; (*® with entry do xX) 


end; 
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procedure AVL_Insert CEntry : entrytype; var p : Entrypointer; 
var balanced : boolean); 
var 
pl, p2 : entrypointer; 
begin 
if (p = nild then begin 
Attach CEntryseoe 
p-.balance := Zero; 
balanced := true; 
end 
else if CEntry.word < p-.word) then begin 
AVL_Insert CEntry, p-.left, balanced); 
If Balanced then (¥ left pointer has grown higher *X) 
case p-~.balance of 


Plus : begin p-.balance := Zero; balanced := false; end; 
Zero : p-.balance := Minus; 
Minus : begin (%® rebalance *¥) 
bl := p-. left; 
if (pl-.balance = Minus) then begin(€® single LL rotatx) 
p-.left := pl-.right<; 
pl-. right <:- 7p; 
p-.balance := Zero; 


p := pl; 
end (* if X®) 
else begin (* double LR rotation *) 
pe c= Clocriomet. 
ol-.right := p2-.left; 
p2e~. left <= pl; 
p-.left := p2-.right; 


pP2=. right = - 9; 
if (€p2-.balance = Minus) then p-.balance := Plus 
else p-~.balance := Zero; 
1f (€p2-.balance = Plus) then pl-.balance := Minus 
else pl-.balance := Zero; 
p := pd; 

end; (* else xX) 

p-.balance := Zero; balanced := false; 


end; (* case of Minus xX) 
end; (* of cases *¥) 
end (€% if Entry.word < p-.word *X) 
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else if € Entry.word > p-~.word) then begin 


end 


end; 


AVL_Insert CEntry, 


P=erighnt, 


balanced); 


if balanced then (® right pointer has grown higher xX) 


case p-~.balance of 


Minus begin p-~.balance := Zero; 
balanced := false; end; 
Zero p-.balance := Plus; 
Plus begin (%® rebalance *) 
bl es =sp= right; 
if (pl-.balance = Plus) then begin 
(¥ single RR X) 
P=.ridas := pl-.left; 
ple chert. := p; 
p-.balance := Zero; 
p := pl; 
end (X% if ¥®) 
else begin (X% double RL rotation X) 
pZ2 := pl-.left; 
pl-.left := p2-.right; 
ep2-.right := pl; 
P-_frightm:= pz2-. left; 
p2>..lefre: = p; 
if (€p2-.balance = Plus) then 
p-.balance := Minus 
else p-.balance := Zero; 
if (€p2-.balance = Minus) then 
pl-~. balance := Plus 
else pl-.balance := Zero; 
Pp :> p2; 
end; €*® double RL rotation xX) 
p-.balance := Zero; balanced := false; 
end; (%® case of balance = Plus X) 
end; (%® of cases *®) 


SrwOo if EMcry.worda > P-.word *) 
else balanced := 
(CX of procedure AVL_Insert *X) 


false; 


procedure Init_Reporter; 


begin 


Rewrite (CReport, 


Writeln 

Writeln 

Writeln 

Writeln 
end; 


"Report text a'); 
Writeln€Report) ; 
Info.Name_of_Text: 

CE Obie 45) ; 
Writeln(Report); 


(Report); 
(Report, 
(Report, 
(Report); 
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Writeln(Report); 
G5); 


procedure Read_Intext; 
var 
1s 
letternum, 
wordnum, 
threshhold, 
stepcount, 
offset : integer; 
¢ : char; 
n : readtype; 
begin 
letternum := Q; 
wordnum := Q; 
stepcount <== 0; 
case Info.Frequency of 
Frequently : threshhold 
Seldomly : threshhold 
Hhen_Done : threshhold 
end; (* cases X) 
offset := ord (C'tA") = ord (a, 
while not EOF CIntext) do begin 
while not EOLN CIntext) and Allswell do begin 
Read CIntext, c); 
if Cc in €.'A".."'Z'.)) them ¢ <= chrCordCc))— otpse. . 
1f (Ce an (.'at 2.82" . 2) ore Ce = Sticeea > 
or Ce in ©. °O*". .'9' .))) then Bbeagrn 


freqnum; 
selnum; 
WhenDonenunm; 


letternum := letternum + 1; 

if Cletternum <= wordlength) then n.word(.letternum.) := c; 
end; 
if Cletternum > 0) then begin 

it ace Y T) torn “Ce '=-') or EOLN CIntext) or 


Cc Jor ce '?!) or Ce %=.* 9 of te] 'ecnen 
begin 

for i := letternum + 1 to wordlength do n.word(.1.):=' 

if Cletternum > wordlength) then n.length := wordlength 

else n.length := letternum; 

tore_Intext (nd); 

Wordnum := Wordnum + 1; 

letternum := 0; 

if €c = '.') or Ce = *2') or Ce = '!") of (Ce -—3 eee 


1f Cwordnum > 1) then begin (X not embedded command *X) 
Signal_End_of_Sentence; 
wordnum := Q; 

end; 

stepcount := stepcount + 1; 

if (€stepcount mod threshhold = 0) then begin 
Page; Writeln; Writeln; Writeln; Writeln; 


Write ('Stylist is now reading : '); 
Wrateln CInfo.Name_of_text); 
Writeln C'Last word read was ", A. word); 


Writeln; Wraiteln; 
Write C'Stylist has read ', stepcount: 5); 
Weitelm ©" words. “);Writeln; 
Writeln; Writeln; 
Writeln (StepcountXl00 div Info.Size:3, * % complete. '); 
end; (* if xX) 
end; 
end; (% letternum > 0 X) 
end; Readin (CIntext); 
end; ©* while mot EGF. ~*~) 
Signal_End_of_File; 
end; 
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procedure Store_Intext (n : readtype); 
var cp : concordpointer; 

found : boolean; 

entry : entrytype; 


begin 
tally.totalwords := tally.totalwords + 1; 
tally.totalletters := tally.totalletters + n. length; 
tally.numwordsthissent := tally.numwordsthissent + 1; 
found := false; 


if (n.length < 5) then Look_Up_Four (n, FourRoot, found); 
if not found then 


tally.LtrsPerWord(.n.length.) := tally.LtrsPerWord(.n.length.) + 
if not found then Look_Up (n, Root); 
end; 


procedure Signal_End_of_Sentence; 
begin 
tally.totalsentences := tally.totalsentences + 1; 
if (tally.totalsentences > maxsent) then begin 
Writeln C'INTEXT TOO BIG.'); Allswell := false; 
end 
else tally.WordsPerSent(.tally.totalsentences.) := 
tally.numwordsthissent; 
tally.numwordstnissent := Q; 
end; 


procedure Signal_End_of_File; 


begin 
if Info.Want_Concord then Lastward-.next := nil; 
tally.numfound := tally.totalwords - tally.unknownwords; 
Page; Writeln; Writeln; Writeln; Writeln; 
Write ('Stylist has finished reading : '); 
Writeln CInfo.Name_of_text); 
end; 
procedure Look_Up_Four (n : readtype; var ancestor : entrypointer; 
var found : boolean); 
var cp : concordpointer; 
entry : entrytype; 
begin 
if Cancestor = nil) then found := false 
else if Cancestor-.word = n.word) then begin 
found := true; 
1f Cancestor-.vigor <> inert) then Add_Values (ancestor); 
tally.totalstrucltrs := tally.totalstrucltrs + n.length; 
tally.structurewords := tally.structurewords + 1; 


if Info.Want_Concord then 
Attach_Concord_Node (ancestor, cp); 
ame «x if X) 
else if (ancestor-.word > n.word) then 
Look_Up_Four (n, ancestor-.left, found) 
else Look_Up_Four (n, ancestor-.right, found); 
end; 


ov 


1; 


procedure Look_Up (n : readtype; var ancestor : entrypointer); 
var cp : concordpointer; 
entry : entrytype; 
p : entrypointer; 
found : boolean; 
ce «= enar; 
begin 
if €ancestor = nil) then begin 
Look_Up_Variants (n, p, found); 
if found then begin 
tally.variantwords := tally.variantwords + 1; 
New Cancestor); 
with ancestor- do begin 


word := n.word; 
left <= nil; rienel: = onal; 
down := nil; 


end; (* with ancestor do *X) 
1f (p-~.status <> not_valuable) then begin 
Add_Values (p); 
with ancestor- do begin 
status := not_storable; 
source := p-~.source; 
difficulty := p-.difficulty; 
concreteness := p=.concreteness; 
emotion := p-.emotion; 
vigor := p-.vigor; 
end; (*® with ancestor do xX) 
end else (*% p-.status is other than not_valuable X) 
ancestor-.status := not_valuable; 
end 
else if Info.Want_Dix_Grow then begin 
Repeat 
Page; Writeln; Writeln; Writeln; 
Writeln (n.word: 50); Writeln; Writeln; 
Writeln C'ADD IT ?7':45); Writeln; Writeln; 
Writeln (€'1) Yes!'!:45); 
Vriteln €'2) No ':45); 
Read (c); Readln; 
Until Cé2=—1") or Cen 2. 
af Ce =" *)")ethen began 
Entry.word := n.word; 
Get_Values_for_New_Entry (CEntry); 
Attach CEntry, ancestor); 
Tally.newwords := tally.newwords + 1; 
Add_Values (ancestor); 
end else Attach_No_Value (Cancestor, n); 
end 
else Attach_No_Value Cancestor, n); 
if Info.Want_Concord then 
Attach_Concord_Node (Cancestor, cp); 
end (X® in ancestor = nil *X) 
else if Cancestor-.word = n.word) then begin 
1f Cancestor-.status <> not_storable) then 
tally.knownwords := tally.Knownwords + 1 
else tally.unknownwords := tally.unknownwords + 1; 
if C€ancestor-.status <> Not_Valuable) then 
Add_Values (Cancestonr); 
1f Info.Want_Concord then 
Attach_Concord_Node (ancestor, cp); 
ema -CXv1t <2 
else if Cancestor-.word > n.word) then 
Look_Up (n, ancestor-~.left) 
else Look_Up (n, ancestor-.right); 
end; 
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procedure Attach (Entry : entrytype; var p : entrypointer); 
begin 


New (Cp); 

with p- do begin 
word := entry.word; 
status := storable; 
source := entry.source; 
difficulty := entry.difficulty; 
concreteness := entry.concreteness; 
emotion := entry.emotion; 
vigor := entry.vigor; 
Herts: = Alls eight °= nil; 
down := nil; 

end; 

end; 


procedure Attach_Concord_Node (p : entrypointer; 


var cp : concordpointer); 
var Bottom, 
Bottommost : concordpointer; 
begin 
Bottom := p-.down; 
while (Bottom <> nil) do begin 
Bottommost := bottom; 
Bottom := Bottom-.down; 
end; 
New (cp); 
With cp- do begin 
Up := p; 
if (Lastward = nil) then Last := nil 
else last := Lastward; 
Down := nil; 
end; 
if (€P-.down = nil) then p-.down := cp 
else Bottommost-~.down := cp; 
if CLastward <> nil) then 
cp~.last-.next := cp; 
lastward := cp; 
end; 
procedure Attach_No_Value (var ancestor : entrypointer; 
n : readtype) ; 
begin 
Writeln Cnewwords, n.word); 
New Cancestor); Ancestor-.word := n.word; 
tally.unknownwords := tally.unknownwords + 1; 
Ancestor-.right := nil; Ancestor-.left := nil; 
Ancestor-7.down := nil; 
Ancestor-.status := not_valuable; 
end; 
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procedure Look_Up_Variants Coriginal : readtype; var p : entrypointer; 
var found : boolean); 


var variant : readtype; 
1 : integer; 
begin 
found := false; 
if Coriginal.length > 3) then 
if Coriginal.word(.original.length.) = 's') then begin 
Variant.word := original .word; 
variant.word(.original.length.) := * §; 
variant.length := original.length - 1; 
Look_Up_Variant (variant, Root, found, p); 
if not found and (variant.word(.variant.length.) = 'e') then 
begin 
variant.word(.variant.length.) := ' '; 
variant.length := variant.length - 1; 
Look_Up_ Variant (variant, Root, found, p); 
if not found and (variant.word(.variant.length.) = 'i') then 
begin 
variant.word(.variant.length.) := 'ty'; 
Look_Up_Variant (variant, Root, found, p); 
end; 
end; 
end 
else if Coriginal.length > 4) then 
if Coriginal.word(.original.length.) = ') and 
Coriginal .word(€.original.length - on = 'n') and 
Coriginal.word(.original.length - 2.) = nyt ) then begin 
Variant.word := original.word; 
for 1 := original.length - ¢ to original.length do 
Variant.word(.1.) := ! 
Variant.length := original. nee = Ss; 


Look_Up_Variant (variant,Root, found, p); 
1f not found then begin 
variant.length := variant.length + 1; 
variant.word(.variant.length.) := ‘'e'; 
Look_Up_Variant (variant, Root, found, p); 
if not found and 
Cvariant.word(.variant.length - 2.) 
variant.word(.variant.length - 1.)) aver begin 
variant.length := variant.length - 2; 
~-= Ff fe 


Variant.word(.variant.length + 1.) ; 
variant.word(.variant.length + 2.) : mers 
Look_Up_Variant (variant, Root, found, p); 
end; 
end; 
end 
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else if Coriginal.length > 4) then 


if Coriginal .word(€.original.length.) = 'd') and 
Coriginal .word(.original.length - 1.) = ‘'e't) then begin 
variant.word >= original .word; 
for 1 := original.length - i to original.length do 
variant.word(.i.) := ! 
variant.length := original. eee ee 5 


Look_Up_Variant (variant,Root, found, p); 
if not found then 
if €variant.word(.variant.length.) = '1') then begin 


variant.word(.variant.length.) := ‘y'; 
variant.length := variant.length - 1; 
Look_Up_Variant (variant,Root, found, p); 
end; 
1f not found then begin 
variant.length := variant.length + 1; 
variant.word(.variant.length.) := ‘te; 


Look_Up_Variant (variant, Root, found, p); 
1f not found and 
(variant.word(.variant.length - 2.) 
variant.word( .variant.length - 1.)) then begin 
Variant.length := variant.length - 
variant.word(.variant.length + 1.) := 
Variant.word(.variant.length + 2.) := 
Look_Up_Variant (variant, Root, found, p); 


, 
" ff, 
4 

f ff. 
, 


end; 
end; 
end 
else if Coriginal.length > 4) then 
1f Coriginal.word(.original.length.) = 'y') and 
Coriginal.word(.original.length ~ 1.) = 'l') then begin 
variant.word := original .word; 
for 1 := original.length - i to original.length do 
Variant.word(.1.) := ! 
variant.length := original. feat ee 
Look _Up_Variant (variant,Root, found, p); 
1f not found then begin 
variant.length := variant.length + 2; 
Variant.word(.variant.length.) := ‘tet; 
variant.word(.variant.length - 1.) := 'l'; 
Look_Up_Variant (variant, Root, found, p); 
1f not found and 
Cvariant.word(.variant.length - 2.) = fit) then begin 
variant.Jength := variant.length - 2; 
variant.word(.variant.length.) := ‘ty’; 
variant.word(.variant.length + 1.) := !' !; 
variant.word(.variant.length + 2.) := ' '; 
Look_Up_Variant (variant, Root, found, p); 
end; 
end; 
end; 
end; 
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procedure Look_Up_Variant (variant : readtype; 


var ancestor : Entrypointer; 
var Found : Boolean; 
var p: Entrypointer); 
begin 
if Cancestor = nil) then found := false 
else if Cancestor-.word = variant.word) then begin 
found := true; 
p := ancestor; 


end 
else if Cancestor-.word > variant.word) then 
Look_Up_Variant (variant, ancestor-.left, found, p) 
else Look_Up_Variant (variant, ancestor-.right, found, p) 
end; 


procedure Add_Values (p : entrypointer); 
begin 
if (@p-.source = Latinate) then 
Tally.NumLatinate := Tally.NumLatinate + 1 
else Tally.NumGermanic := Tally.NumGermanic + 1; 
Case p=. difficulty of 
PostGrad : Tally.NumPostGrad := Tally.NumPostGrad + 1; 
Grad >: Tally.NumGrad := Tally.NumGrad + 1; 
High_School : Tally.NumHigh_School := Tally.NumHigh_School + 1; 
Elementary : Tally.NumElementary := Tally.NumElementary + 1; 
end; 
1f (P-~.concreteness = Intangible) then 
Tally.NumIntangible := Tally.NumIntangible + 1 
else Tally.NumTangible >= Tally.NumTangible + 1; 
case p-.emotion of 
Sublime : Tally.NumSublime := Tally.NumSublime + 1; 
Pleasant : Tally.NumPleasant := Tally.NumPleasant + 1; 
Neutral : Tally.NumNeutral := Tally.NumNeutral + 1; 
Unpleasant : Tally.NumUnpleasant := Tally.NumUnpleasant + 1; 
Horrid : Tally.NumHorrid := Tally.NumHorrid + 1; 
end; 
case p-.vigor of 
Violent : Tally.Numviolent := tally.numviolent + 1; 
Energetic : Tally.NumEnergetic := Tally.NumEnergetic + 1; 
Calm : Tally.NumCalm := Tally.NumCalm + 1; 
Inert : Tally.NumInert := Tally.NumInert + 1; 
end; 
end; 
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procedure Get_Values_for_New_Entry (var Entry : entrytype); 
war ¢ : char; 
good_char : boolean; 
begin 
Repeat 
Page; 
Writeln; Writeln; Writeln; 
Writeln (C'SOURCE ':45); Writeln; Writeln; 
Writeln CEntry.word:45); 
Writeln; Writeln; 
Hevteln «°!) LATINATE *<:45); 
Writeln €'2) GERMANIC !':45); 
Read (c); Writeln (€¢:45); Readln; 
Ometl Cc = '1") or Ce = '2°*); 


if €c = 'l') then Entry.source := Latinate 
else Entry.source := Germanic; 
Repeat 
Page; 


Writeln; Writeln; Writeln; 
earelnec DIFFICULTY LEVEL ':45); 
Writeln; Writeln; 
Writeln CEntry.word:45); Writeln; Writeln; 
Writeln €'1) POSTGRADUATE!:45); 
Writeln €'2) GRADUATE 3:45); 
Writeln €'3) HIGH SCHOOL ':45); 
Writeln €'G) ELEMENTARY ':45); 
Read (€c); Writeln (€¢:45); Readln; 
Watil Ce = *1') or Cc = *2') or Ce = *3') or Ce = '4'); 
case c of 
Dltee Entry.disticul ty PostGrad; 
2 eM tive ficul ty Grad; 
ee Entry edit ficulLty High School; 
iG eee Entry. di fticul ty Elementary; 
end; 
Repeat 
Page; Writeln; Writeln; Writeln; 
Writeln C'CONCRETENESS ':45); Writeln; Writeln; 
Writeln CEntry.word:45); Writeln; Writeln; 
Writeln €'1) TANGIBLE ':45); 
Writeln €'2) NOT TANGIBLE ':45); 
Read(€c); Readln; Writeln; Writeln (€¢:45); 
were = SI) of Co = '2'); 
if €c = '1') then Entry.concreteness 
else Entry.concreteness 


Intangible 
Tangible; 


Repeat 
Page; Writeln; Writeln; Writeln; 
Writeln C'EMOTIONAL CONNOTATION ':45); Wraiteln; Writeln; 
Writeln CEntry.word:45); Writeln; Writeln; 


Writeln €'1) SUBLIME *:45); 
Writeln €'2) PLEASANT ':45); 
Writeln €'3) NUETRAL ':45);3 
Writeln €'G) UNPLEASANT! :45); 
Writeln €'5) HORRID ":45); 


Read(c); Readln; Writeln; Writeln (¢:45); 
Umtil Cc = '1!') or Ce = *'2') or Ce = §3!') or Cc = §4')orle = '5!'); 


case ¢c o 
‘1! : Entry.emotion := Sublime; 
f2' : Entry.emotion := Pleasant; 
'3* ;: Entry.emotion := Neutral; 
'G' : Entry.emotion := Unpleasant; 
a : Entry.emotion := Horrid; 
end; 


oS 


Repeat 
Page; Writeln; Writeln; Writeln; 
Writeln C'VIGOR':45); Wraiteln; Writeln; 
Writeln CEntry.word:45); Writeln; Writeln; 
Writein €'1) VIOLENT : 3 
Writein €'2) ENERGETIC ':45); 
Writeln (€'3) CALM ms ; 
Writeln €'S) NONE a5): 
Read (c); Writeln (€¢:45); Readin; 
Until €e = '1') or Ce = "2') or Ce = "3! Ge Ceee er 
Casec (On 


"l' : Entry.vigor == Violen 

'2' : Entry.vigor <= Energetize,; 

es): “Entry. vigor = =) calm. 

'46" : Entry.vigor <= inere. 
end; 


end; 


procedure Write_Concordance; 
begin 
Page; Writeln; Writelin; Writelin; Writeln; 
Write C'Stylist 1s now writing the concordance for '); 
Writeln CInfo.Name_of_text); 
Wratelin; Writeln; 
Writelin C€'Concordance will be written to "Conerdne text a™':50); 
Wraiteln; Writelin; 
Writeln (CConerdnc); Writein CConcrdne); 
Writein (CConerdnce, ‘CONCORDANCE! :55); 
Mritein (€Conerdnc, Info.Name_of_text: 55); 
Writelin CConerdnc); Wraitein (Concerdne), 
Inorder_Concordance (Root); 
end; 
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procedure Inorder_Concordance (var Root : entrypointer) ; 
x 


This 1s the most complex procedure in the program. It traverses *) 
oe the Dixonary Binary Search Tree (BST) in inorder fashion, this *) 
(x visiting each important word used in the Intext in alphabetic *) 
(x order. During the visit to each word, it goes down the concord x) 
( list one at a time, thus examining each use of that word in the x) 
( x order it was used in the Intext. At each visit down the concord *) 
(x list, it follows the Lastward linked list to find the phrase in  ¥*) 
(x the Intext that preceded that instance of the use of the word, ¥) 


( x prints the phrase,capitalizes and prints the word, then follows *) 
(x the Nextward linked list to find and print the phrase following. xX) 


var Lastward, (*¥ pointer to word previous in intext *) 
Nextward, (¥ pointer to word following in intext) 
Downward : concordpointer; (* pointer to next use of same word x) 
Phrasel, (* phrase preceding key word in concord*®) 
Phrase2 : Phrasetype; (* phrase following Key word in concord) 
W> (*¥ # of word being printed *) 
iS (* # of letter in word being printed *X) 
offset, 
1 : integer; 
en: char; 

begin 


Srgset := ord ('A') - ord ('a'); 
1f €CRoot <> nil) then begin 
Inorder_Concordance (Root-.left); 
Downward := Root-.down; 
While eeennneng <> nil) do begin 
W 3: ; 
Lastward := Downward-.last; 
while Seas ote <> nil) and (w < Phraselength) do begin 
Ws=>w t ; 
Phrasel(€.w.) := Lastward-.up~.word; 
Lastward := Lastward-.last; 
end; 
for 1 := w downto 1 do begin 


whale C1 <= wordlength) do begin 
if (Phrasel(.1.)¢.1.) <> * ') then 
Write (Conerdnc, Phrasel(.i1.)¢€.1.)); 


1 := 1 + 1; 
end; 
Write (Conerdne, !§ ':1);3 
end; 
Write (Conerdnc, ! ae 
ie. = 1; 
while (1 <= wordlength) do begin 
Cut =eRoot= wordt. .); 


Pemecerm (aa. ,'2'..))- then 
Ce= chrvord(c) + offset); 

if CRoot~.word(.1.) <> ' ') then 
Write (Conerdnc, c); 


ties te); 
end; 
Write (Conerdnc, ! se 
w := Q; 
Nextward := Downward-.next; 


while (Nextward <> nil) and 
(w < Phraselength) do begin 


wsi=wt 1; 
Prrase2(.w.) := Nextward-.up-.word; 
Peoacnerd >= Nextward-.next; 

end; 
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for 1 := lbote wide beara 


1 := 1; 
while €1 <= wordlength) do begin 
if €PhraseZ(.1.)0.1.) <> © then 
Write CConerdne, PhArase2(.1.7@. eo. 


12 = Loop 
end; 
Write CConerdnc, % 7.) 
end; 
Writeln C€Conerdnc); 
Downward := Downward-. down; 
end; 


Inorder_Concordance (Root-.right); 
end; (X if Roet <> mal x) 
end; (* procedure Inorder_Concordance X) 
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procedure Calculate_Profile; 


var numsigltrs : integer; (¥ number of significant letters *) 
numsigwrds : integer; (¥ number of Significant words xX) 
(*¥ in this context, structureal words are insignificant X) 
begin 
if (tally.totalsentences = 0) then tally.totalsentences := 1; 
if (tally.totalwords = 0) then tally.totalwords := 1; 
numsigwrds := tally.totalwords - tally.structurewords; 
numsigltrs := tally.totalletters - tally.totalstrucltrs; 


Profile := tally; 
profile.Ave_ltrs_per_word := numsigltrs/numsigwrds; 
profile.Ave_wrds_per_sent := tally.totalwords/’tally.totalsentences; 


end; 


procedure Analyze_Profile_and_Report; (% main Reporter procedure X) 
begin 
Page; Writeln; Writeln; Writeln; Writeln; 
Wrate C'Stylist is now analyzing the style of '); 
Writeln CInfo.Name_of_text); 
Wrateln; Writeln; 
Wraiteln €'Report of analysis will be written to "Report text a™':50); 
Wraiteln; Writeln; 
Wriateln CReport); Wraiteln CReport); 
Wrateln CReport); Writeln CReport); 
Calculate_Size_of_Text (profile, analysis); 
Calculate _Length_of_Words (profile, analysis); 
Calculate_Length_of_Sentences (profile, analysis); 
Calculate_Etymology (profile, analysis); 
Calculate_Difficulty_of_Vocabulary (profile, analysis); 
Calculate_Tangibility (profile, analysis); 
Calculate_Emotional_Tone (profile, analysis); 
Calculate_Vigor_of_Words (Cprofile, analysis); 
pane cconmendatzons Canalysis); 
end; 
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procedure Calculate_Size_of_Text (profile : profiletype; 
var analysis : analysistype); 


Var Aes 
PerCentFound : integer; 
begin 
Writeln(€Report); Writeln(Report); Writeln(€Report); 
Write CReport, ‘Total of sentences : '); 
Writeln (Report, profile.Totalsentences) ; 
Write (Report, ‘Total of words - Soy 
Writeln (Report, profile.Totalwords); 
Mrite CReport, 'Total of letters a fg 


Writeln CReport, profile.totalletters); 
Writeln CReport); Writeln€Report); WritelnCReport) ; 
if €profile.totalwords < 500) then begin 
Write (Report, Profile.totalwords, ' words are '); 
Soe (Report, ‘too few for valid statistical analysis.'); 
en 
else if Cprofile.totalwords < 1000) then begin 
Write €Report, Profile.totalwords, ' words are'); 
aacern (Report, ' enough for valid statistical analysis.'); 
en 
else begin 
Mrite CReport, Profile.totalwords, ' words are '); 
Wrateln CReport, ' plenty for valid statistical analysis.'); 
end; 
Writeln CReport); Writeln€Report); Writeln€Report) ; 
Write CReport, ‘Of the ', profile.totalwords:6, * words in '); 
Writeln (Report, Info.Name_of_Text, ','); 
Write CReport, profile.numfound:6); 
Writeln (CReport,' were matched to words in the Stylist dictionary.'); 
percentfound := profile.numfound X*® 100 div profile.totalwords; 
Writeln CReport); 
Writeln CReport, Percentfound:2, ' % were matched. '); 
if €percentfound < 50) then begin 
Nrite CReport, Info.Name_of_text); 
Hraiteln CReport, ' must contain many specialty or unique words.'); 
Wraiteln CReport, 'Statistical analysis 1s not valid.'); 
end 
else Writeln (Report, 'This is enough for statistical analysis.'); 
if Cprofile.newwords > 0) then begin 
Write CReport, 'You added ', profile.newwords); 
Writeln CReport,' to the dictionary during this session.'); 
end; 
end; (xX procedure Calculate Size of intext xX) 
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procedure Calculate_Length_of_Words (profile : profiletype; 
var analysis : analysistype); 
begin 
Graph_LtrsPerWord (profile.ltrsperword); 
Writeln (Report); Writeln (Report); 
Write (Report, 'The average number of letters per word eel): 
Writeln (Report, profile.Ave_ltrs_per_word); 
Writeln(€Report); Writeln(€Report); 
if (Info.Genre = NonFiction) then begin 
Writeln CReport, 'A typical nonfiction texts distribution '); 
Write (Report, ‘resembles a low bell-shaped curve '); 
Writeln (Report, ‘centered around six letters/word.'); 
if €profile.Ave_Ltrs_Per_Word > 7) then 


Analysis.WordLength := Toolong 

else if (profile.Ave_Ltrs_Per_Word > 6.5) then 
Analysis.WordLength := Long 

else 17 (Cprofile.Ave_Ltrs_Per_Word > 5.5) then 
Analysis.WordLength := medium 

else if (profile.Ave_Ltrs_Per_Word > 5) then 
Analysis.WordLength := short 

else Analysis.WordLength := tooshort 


end 

else begin (* genre is fiction xX) 
Mawcelm (Report, "A typical fretion texts distribution '); 
Write (Report, ' resembles a tall bell-shaped curve '); 
Writeln (Report, ‘centered around five letters/word.'); 
if (profile.Ave_Ltrs_Per_Word > 6) then 


Analysis.WordLength := Toolong 

else if (Cprofile.Ave_Ltrs_Per_Word > 5.5) then 
Analysis.WordLength := Long 

else if (profile.Ave_Ltrs_Per_Word > 5) then 
Analysis.WordLength := medium 

else if (profile.Ave_Ltrs_Per_Word > 4) then 
Analysis.WordLength := short 

else Analysis.WordLength := tooshort; 


end; 
Writeln CReport); 
Write (Report, 'The length of the words is '); 
case Analysis.WordLength of 
iooshort : Writeln CReport, ‘too short.'); 
Short : Writeln (Report, 'short.'); 
medium : Writeln Creport, 'medium'); 
Long : Writeln (Report, ‘long.'); 
Toolong : Writeln (CReport, ‘too long.'); 
end; 
Writeln (Report); Writeln€ Report); Writeln(Report); 
end; (X® calculate length of words xX) 
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procedure Graph_Ltrsperword (ltrsperword : ltrsperwordtype) ; 


var perltrwords : ltrsperwordtype; 
| a ee integer; 
begin 


Page (Report); 

Writeln(Report); Writeln(Report); Wrateln(Report); 

Writeln (Report, BREAKDOWN OF PERCENT OF LETTERS PER WORD':60); 
Writeln(Report); Writeln(CReport) ; 


for 13> 0 <oewomdvencnn do 
PerLtrWords(€.1.) := (CLtrsPerWord(.1.)*100) div 
Ctally.totalwords -tally.structurewords) ; 
fori i <= 15 downtoel@ado begin 
Write CReport,ix2:2, ' 74); 
TOr 3) = eto wordlength do 


if (PerLtrWords(. 2: } >= 1 ¥2) then 
Write (Report,' xX on) 

else if (PerltrWords(. J. ) >= (1¥2)-1) then 
Write (CReport,' ; ; 5o 

else 
Write (Report, '! 5s 


Writeln (Report); 
end; 
Write (€Report, ! :4); 
for i = 12 to jordans do 
Write CReport,1:5); 
Writeln (Report); 
Writeln (Report, 'Number of Letters in a Word!:45); 
end; 
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procedure Calculate_Length_of_Sentences (profile : profiletype; 
var analysis : analysistype); 


var 
1» 
Sweetspots, 
Runons, 
PerCentRunons, 
PercentSweetSpots : integer; 
begin 
Graph_WordsPerSent (profile.WordsPerSent) ; 
Write (Report, 'The average number of words per sentence int); 


Writeln (Report, profile.Ave_wrds_per_sent); 
Writeln(Report); Writeln(CReport); Writeln(CReport); 
Write CReport, ‘A typical modern texts sentences average '); 
Nriteln (CReport, ' between fifteen and twenty words.'); 
1f (profile.ave_wrds_per_sent > 22) then 
Analysis.SentLength := Toolong 
else if (profile.ave_wrds_per_sent > 18) then 
Analysis.SentLength := Long 
else 1f (profile.ave_wrds_per_sent > 15) then 
Analysis.SentLength := Medium 
else if (profile.ave_wrds_per_sent > 10) then 
Analysis.SentLength := Short 
else Analysis.SentLength := TooShort; 
Writeln CReport); 
Write (Report, ‘Sentences are '); 
case analysis.SentLength of 
moon neorte- wNritelm CReport, ‘too short. '*"); 
mort: Writeln CReport, ‘short.'); 
medium : Writeln Creport, ‘medium'); 
Long : Writeln CReport, ‘long.'); 
mooaltong =: Writeln CReport, ‘too long.'); 


end; 
Sweetspots := 0; 
Runons := 0; 
for 1 := ] to profile.totalsentences do 
if (Cprofile.wordspersent(.i.) > 45) then 


runons := runons + 1] 
else 1f (Cprofile.wordspersent(.1.) > 8) and 
Cprofile.wordspersent(.1.) < 20) then 


sweetspots := sweetspots + 1; 
PerCentRunons := runons*100 div profile.totalsentences; 
if (PercentRunOns > 5) then Analysis.Runons := unacceptable 
else if (PercentRunOns > 0) then Analysis.Runons := acceptable 
else Analysis.Runons := nonexistant; 
Writeln CReport); 
Wraiteln CReport, ‘Number of run ons ', runons:4); 


Writeln Creport, ‘Percent of run ons ', percentrunons: 4); 
Writeln CReport); 
Write (Report, 'Run ons are '); 
case Analysis.Runons of 
Nonexistant : Writeln (Report, 'nonexistant.'); 
Acceptable : Writeln (Report, ‘acceptable.'); 
Ree eP sable : Writeln Creport, ‘unacceptable'); 
end; 
PerCentSweetSpots := sweetspots*100 div profile.totalsentences; 
if (PercentSweetspots > 50) then Analysis.Modulation := Good 
else if (PercentSweetspots > 20) then Analysis.Modulation := Average 
else Analysis.Modulation := bad; 
Wraiteln (CReport); 
Writeln (Report, ‘Number of medium length sentences ', 
Write (Report, ‘Percent of medium length sentences '); 
Writeln (Report, percentsweetspots: 4); 
Writeln CReport); 


sweetspots: 4); 
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Write (Report, 'Modulation is '); 
case Analysis.Modulation of 
Good : Writeln (Report, ' good.'); 
Average : Writeln (Report, 'average'); 
Bad : Writeln (report, ‘bad'); 
end; 
end; (X* procedure analyze length of sentences Xx) 
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procedure Graph_WordsPerSent (Graph : WordsPerSenttype); 
var i, j, pagenum : integer; 
begin 
Pagenum := 1; 
While (Tally.totalsentences > (Pagenum - 1) * 70) do begin 
Page (Report); 
Writeln CReport,'BREAKDOWN OF NUMBER OF WORDS PER SENTENCE':60); 
Writeln CReport,'Number of words'); 


for i := 1 * (€CPagenum - 1) * 70) to 70 * CPagenum) do 
Graph(.i.) := (Graph€.1.) div 2); 

for i := 50 downto 1 do begin 
Write CReport,1%2:2, ' *2 105 
for j := 1 *® (CPagenum -1) * 70) to 70 * (CPagenum) do 


if (Graph(€.3.) >= 1) then 
Write (Report, 'X':1) else Write (Report, ' ':1); 
Writeln CReport); 


end; 
Write (Report, ' '); 
Tor 1 <= |] to 7 do 


Write (Report, 1 * 10 + (€CPagenum - 1) * 70):10); 
Writeln CReport); 
Writeln (Report, ‘Sentence Number':45); 
Writeln CReport); 
Pagenum := Pagenum + 1; 
end; 
end; 


procedure Calculate_Etymology (profile : profiletype; 
var analysis : analysistype); 
begin 
Nriteln (Report); Writeln (Report); WritelnCReport); 
Writeln (Report, ‘ETYMOLOGY OF WORDS':45); 
Writeln€Report); WritelnCReport); 


Wirite (Report, ‘Number of Latinate words tr: 
Writeln (Report, profile.NumLatinate); 
Write (Report, ‘Number of Germanic words et ):. 


Writeln (Report, profile.NumGermanic) ; 
Writeln (Report); 
if (Info.Genre = NonFiction) then begin 
if Cprofile.NumLatinate > profile.NumGermanic * 3) then 


Analysis.Etymology := TooBorrowed 

else if Cprofile.NumLatinate > profile.NumGermanic X¥* 2) then 
Analysis.Etymology := Borrowed 

else if Cprofile.NumLatinate > profile.NumGermanic) then 
Analysis.Etymology := Mixed 

else if Cprofile.NumLatinate > profile.NumGermanic * 0.9) then 
Analysis.Etymology := Native 

else Analysis.Etymology := TooNative; 


end 
else (*% genre = Fiction *) begin 
if €profile.NumLatinate > profile.NumGermanic X* 2) then 


Analysis.Etymology := TooBorrowed 
else if (Cprofile.NumLatinate > profile.NumGermanic * 1.5) then 
Analysis.Etymology := Borrowed 


else 1f Cprofile.NumLatinate > profile.NumGermanic) then 
Analysis.Etymology := Mixed 
else if (Cprofile.NumLatinate > profile.NumGermanic * 0.5) then 
Analysis.Etymology := Native 
_° Analysis.Etymology := TooNative; 
end; 
Write (Report, 'Etymology is '); 
case Analysis.Etymology of 
TooBorrowed : Writeln (Report, ‘very borrowed.'); 
Borrowed : Writeln (Report, ‘borrowed.'); 
Mixed : Writeln (report, 'mixed'); 
Native : Writeln (Report, ‘native.'); 
migctative : Writeln (Report, ‘very native.'); 
end; 
end; (% procedure calculate etymology *) 
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procedure Calculate_Difficulty_of_Vocabulary (profile 
var analysis 


profiletype; 
analysistype); 
var ; 
PerCentPostGrad, 


end; 


PerCentGrad, 
PerCentHigh integer; 
begin 


Writeln CReport); 


Writeln (Report, 
Writeln(CReport); 
Write (Report, ! 
Writeln CReport, 
Write (Report, '! 
Mriteln CReport, 
Write CKeport, ‘' 
Writeln CReport, 
Write (Report, ' 
Writeln CReport, 
WritelaCReport); 
percentPostgrad 
percentGrad := 


Writeln€Report); WritelnCReport) ; 
"DIFFICULTY OF VOCABULARY':45); 
Writeln(Report); 


PostGraduate difficulty Ls 
profile.NumPostGrad); 

Graduate difficulty ee; 
profile.NumGrad); 

High Sehool difticulzy, or; 
profile.NumHigh_School ); 
Elementary difficulty F 


profile.NumElementary) ; 
Writeln@CReport); 
= profile.numpostgrad*100 div profile.numfound; 


profile.numgrad*100 div profile.numfound; 


percenthigh := profile.numhigh_school*100 div profile.numfound; 
if CpercentPostgrad > 0) then analysis.HardWords := veryhard 
else if Cpercentgrad > 5) then analysis.HardWords := hard 
else if Cpercenthigh > 10) then analysis.HardWords := challenging 
else analysis.HardWords := easy; 

Write (Report, ‘Percent of Postgraduate difficulty '); 


Writeln CReport, 
Writeln (CReport, 
Writeln CReport, 
Writeln€CReport); 
Write CReport, ' 


percentpostgrad); 

'Percent of Graduate difficulty 
"Percent of High School difficulty * 
Writeln@CReport) ; 

Da fficulty-is 9, 


t 
? 


percentgrad); 
percenthigh); 


? 


case Analysis.HardWords of 


Veryhard: Wri 


hard: Writeln (Report, 


challenging: 

easy Writel 
end; 

(x 


teln CReport, ‘very hard'); 
‘hard'); 

Writeln Creport, 'challenging'); 
n CReport, ‘teasy.'); 


Calculate _Difficulty_of_Vocabulary *) 
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procedure Calculate_Tangibility (profile : profiletype; 


var analysis : analysistype); 


begin 
Writeln (Report); Writeln€Report); Writeln(€Report); 
Writeln (Report, "TANGIBILITY':45); 
MritelnCReport); WritelnCReport); 
Write (Report, ' Number of Tangible words a 
Writeln CReport, profile.NumTangible) ; 
Write (Report, ' Number of Intangible words §:'); 
Writeln CReport, profile.NumIntangible); 
if CInfo.Genre = NonFiction) then begin 
if (profile.NumIntangible > profile.NumTangible *® 4) 


Analysis.Tangibility := Soft 
else if Cprofile.NumIntangible > profile.NumTangible 
Analysis.Tangibility := Firm 
else Analysis.Tangibility := Solid; 
end 
else (*® genre = Fiction *) begin 
if (profile.NumIntangible > profile.NumTangible X® 3) 
Analysis.Tangibility := Soft 
else if (Cprofile.NumIntangible > profile.NumTangible 
Analysis.Tangibility := Firm 
else Analysis.Tangibility := Solid; 


end; 
Writeln CReport); 
Write CReport, ‘Tangibility is '); 
case Analysis.tangibility of 
Solid : Writeln CReport, ' very tangibible.'); 


Farm : Writelm (Report, ' tangibible. Ww; 
Soft : Writein CReport, ' very intangibible.'); 
end; 


end; (*® calculate tangibility xX) 
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then 
x2) then 


then 
Xi) then 


procedure Calculate_Emotional_Tone (profile : profiletype; 
var Analysis : analysistype); 
var 
PerCentSublime, 
PerCentPleasant, 
PerCentUnpleasant, 
PerCentHorrid, 
Emotionindex : integer; 
begin 
Writeln (€CReport); Writeln€Report); Writeln(€Report); 
MHriteln CReport, ‘EMOTIONAL CONNOTATIONS' :45); 
Writeln€Report); WritelnCReport); 


Write (Report, ' Sublime connotations : ©OG 
Writeln (Report, profile.NumSublime); 

Write (Report, ' Pleasant connotations : §)2 
Writeln CReport, profile.Numpleasant) ; 

Write (Report, ' Neutral connotations 32S) 
Writeln CReport, profile.NumNeutral); 

Write CReport, § Unpleasant connotations : '); 
Hriteln (Report, profile.Numunpleasant); 

Write (Report, ' Horrid connotations ey) 


Writeln CReport, profile.NumHorrid); 
Writeln€Report); Writeln(€Report); Writeln (Report); 


PercentSublime := profile.numsublimeX*l00 div profile.numfound; 
percentPleasant := profile.numpleasantxX1l00 div profile.numfound; 
percentUnpleasant := profile.numunpleasant*1l100 div profile.numfound; 
percenthorrid := profile.numhorridx*100 div profile.numfound; 
EmotionIndex := (percentsublime X* 5) + 


(percentpleasant * 2) + 
(percentunpleasant * 2) + 
(percenthorrid X* 5); 


1f (EmotionIndex > 20) then Analysis.emotionality := rich 

else if (EmotionIndex > 10) then Analysis.emotionality := standard 

else Analysis.emotionality := poor; 

Writeln (Report, ‘Percent of sublime connotations ',PercentSublime) ; 
Writeln CReport, ‘Percent of pleasant connotations ',percentPleasant); 
Write (Report, 'Percent of unpleasant connotations '); 

Writeln (Report, percentUnpleasant); 

Weriteln CReport, ‘Percent of horrid connotations ',percenthorrid); 
Writeln CReport); 

Writeln (Report, tIndex of Emotionality ', EmotionIndex) ; 


Writeln CReport); 
Vrite CReport, 'Emotionality is '); 
case Analysis.Emotionality of 
Rich: Writeln (Report, ‘'Rich'); 
Standard : Writeln (Report, ‘average'); 
Poor: Writeln Creport, ‘'poor'); 
end; 
if ((€Cpercentsublime * 5) tpercentpleasant) > 
({percenthorrid * 5) + percentunpleasant) * 1.2) then 
Analysis.Tone := positive 
else if (C€€percentsublime * 5) +percentpleasant) *¥ 1.2 < 
(Cpercenthorrid * 5) + percentunpleasant)) then 
Analysis.Tone := negative 
else Analysis.Tone := bland; 
Writeln CReport); 
Write (Report, ‘Tone is '); 
case Analysis.Tone of 
Positive: Writeln (Report, 'Positive'); 
Bland: if Canalysis.emotionality= rich) then begin 
Write (Report, ‘ta balance of strong positive '); 
Writeln (Report, ‘and strong negative emotions.'); 
end else Writeln (Report, 'bland.'); 
eve ee Writeln Creport, 'negative'); 
end; 
end; (€* procedure calculate emotional tone xX) 
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procedure Calculate_Vigor_of_Words (profile : profiletype; 
var analysis : analysistype); 


var 
PerCentViolent, 
PercentEnergetic, 
PercentCalm, 
StrengthIndex : integer; 
begin 


Writeln CReport); Writeln€Report); Writeln€Report); 
Writeln CReport, ‘VIGOR OF WORDS!:45); 
WritelnCReport); WritelnCReport); 


Write CReport, * Words of Extreme Vigor : '); 
Writeln CReport, profile.NumViolent); Writeln; 
WriteCReport, ' Words of Much Vigor SD ee 
Writeln CReport, profile.NumEnergetic); 

Write CReport, ' Words of Some Vigor 7); 
Wraiteln CReport, profile.NumCalm); 

Write CReport, ' Words of Little Vigor cee oe. 


Writeln CReport, profile.numinert); 
WritelnCReport); Writeln€Report); Writeln(CReport); 


percentViolent := profile.numviolentx*100 div profile.numfound; 
percentEnergetic := profile.numenergeticxXl100 div profile.numfound; 
percentCalm := profile.numcalm*100 div profile.numfound; 
StrengthIndex := percentviolent ¥* 10 + 

percentenergetic * 5 + 

percentcalm; 
if (StrengthIndex> 60) then Analysis.Strength := VeryStrong 
else if (StrengthIndex > 50) then Analysis.Strength := strong 
else if (StrengthIndex > 20) then Analysis.Strength := lively 
else Analysis.Strength := weak; 
Write (Report, 'Percent of words of extreme vigor : '); 
Mriteln CReport, percentViolent) ; 
Nrite CReport, 'Percent of words of much vigor : '); 
Writeln CReport, percentEnergetic); 
Write (Report, 'Percent of words of some vigor 2 SOR 


Writeln CReport, percentCalm); 
Writeln CReport); 
Writeln CReport, ‘Index of Vigor ', StrengthIndex); 
Writeln€Report); 
Write CReport, 'Vigor is '); 
case Analysis.Strength of 
VeryStrong: Writeln CReport, ‘very strong'); 
strong: Writeln CReport, ‘strong'); 
lively: Writeln Creport, ‘lively'); 
weak : Writeln CReport, ‘weak.'); 
end; 
end; (€X% procedure calculate vigor of words *) 


aeeoure Write NumRecom (Var Numrecom : integer); 
egin 
NumRecom := NumRecom + 1; 
Writeln€Report); Writeln(CReport); 
Writeln CReport, ‘RECOMMENDATION NUMBER ':55, NumRecom:2); 
econ Report); WritelnCReport); 
end; 
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procedure Make_Recommendations (analysis : analysistype); 


var NumRecom : integer; (* # of recommendations made so far *) 
begin 

Writeln CReport); Writeln CReport); 

NumRecom := Q; 


with analysis do begin 
if (Runons = Unacceptable) then begin 
Write _NumRecom (NumRecom); 
Writeln (€CReport, ‘You tend to write run-on sentences.'); 
Writeln (€Report, 'Check your longest sentences for run ons.'); 
Writeln (Report, 'Break them up into units of single ideas.'); 
end; 
if (SentLength = Toolong) and (CHardWords > Easy) then begin 
Write _NumRecom (CNumRecom); 
Write (Report, ‘Your average sentences are too long '); 
Wrateln CReport, ‘for the difficulty of your vocabulary. ') 
Writeln (Report, ‘Use simpler words or shorter sentences. ' 
end; 
if (Modulation = Bad) then begin 
Write_NumRecom CNumRecom) ; 
Write (Report, 'Your sentences tend to be too short or too'); 
Writeln (Report, ‘long. Try to moderate and modulate '); 
F Writeln (Report, ‘the length of your sentences.'); 
end; 
if CEtymology = TooBorrowed) then begin 
Write _NumRecom CNumRecom); 
Writeln (Report, ‘Use shorter, more native English words.'); 
end; 
1f CEmotionality = Poor) then begin 
Write _NumRecom (CNumRecom) ; 
Writeln €CReport, ‘Use more words that provoke emotions.'); 
end; 
if CInfo.Genre = Fiction) then begin 
if (Strength < Strong) then begin 
Write _NumRecom (CNumRecom) ; 
Writeln (Report, ‘Use more evocative, sensory words.'); 
end; 
if CEtymology < Mixed) and CHardwords > Challenging) and 
(SentLength > Medium) then begin 
Write _NumRecom (CNumRecom) ; 
Wrateln CReport, ‘Your fiction reads like non-fiction. "); 
Writeln CReport, ‘Unless you are targetting a highly '); 
Writeln (Report, ‘literate audience, keep it sSimple.'); 
end; 
if (Tangibility = Soft) then begin 
Write NumRecom (CNumRecom) ; 


) 


Write (Report, 'A narrative should be concrete and '); 
Writeln (Report, ‘detailed. Describe things, not ideas. '); 
end; 


end else begin (*® genre is nonfiction *) 
if CEtymology = TooNative) and (Strength = VeryStrong) then begin 
Write_NumRecom (CNumRecom) ; 


Writeln (Report, 'Your non-fiction reads like fiction.'); 

Write CReport, ‘Ask yourself if it is too vigorous '); 

Writeln CReport, ' for the audience that will read i1t.'); 
end; 


if (Strength = Weak) then begin 
Write _NumRecom (NumRecom) ; 
Writeln (€Report, ‘Use more evocative, sensory words.'); 
end; 
end; 
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if (Strength > Lively) and (CEmotionality > Standard) and 
(Modulation = Good) then begin 


Write (Report, 'Congratulations! You write with '); 
Writeln (CReport, ‘strength and grace. '); 
end; 
1f €Runons = nonexistant) then begin 
Write (Report, 'Congratulations! You never seem '); 


Writeln CReport, 'to write run-on sentences.'); 
end; 
If (NumRecom = 0) then begin 
Writeln CReport); 
Write (Report, 'This is a solid piece of writing '); 
Writeln (Report, ‘well within the traditions of its genre.'); 
Writeln CReport); 
Write (CReport, ‘You are as able to understand the meaning of'); 
Writeln (Report, ' the above characteristics as The Stylist.'); 
Writeln CReport, 'Stand the course!'); 
end; 
end; (€* with analysis do xX) 
end; (X® procedure make recommendations xX) 
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procedure Store_New_Dix; 


begin 
Rewrite (Dixonary, 


Page; Writeln; Writeln; 


"Dixonary text a °o); 


Writeln; 


Writeln; 


Write (€'Stylist 1S now storing the new dictionary. ':55); 


Store_BST GRoot); 
end; 


procedure Store_BST (¢ 
begin 
if Cp <> nll) ethen 


p : entrypointer); 


begin 


if (p-.status = storable) then Store (p-); 
Store BSI Ces. lefc. 
Store BST Cp- rigme): 


end; 
end; 


procedure Store (Entry 


entrytype) ; 


var 
sourceltr, 
a2trreu: tyltr, 
concretenessltr, emotionltr, 
vyigorltr = char; 
begin 
with entry do begin 
if (source = Latinate) then sourceltr := 'l' 
else sourceltr := 'g!; 
case difficulty of 
Postgrad Gifficultyltr  =5'p.; 
Grad Gi fficul ttyl trm.-=. gt, 
High_School Gitrveuwlcvyl er = hs 
Elementary difficultyi tr. :—- Ae; 
end; 
if €concreteness = Tangible) then concretenessltr := 't! 
else concretenessltr := '1'; 
case emotion of 
Sublime emotionltr := 's!; 
Pleasant emotionltr := ‘'p'; 
Neutral emotionltr := 'n'; 
Unpleasant emotionltr := ‘'u'; 
Horrid emotionltr := 'h'; 
end; 
case vigor of 
Violent vVigorltr = tv"; 
Energetic vigorltr := ‘te*; 
Calm vigorltr := tc!; 
Inert ViIgGorlthu:= “ic: 
end; 
end; (*® with entry do X) 
Wraiteln (Dixonary, Entry.word, sourceltr, difficultyltr, 


end; 


concretenessltr, 
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emotionltr, 


vigorltr); 


APPENDIX D 
THE CODE OF DIXSPLIT AND DIXJOIN 


(x NOTE : THE FOLLOWING IS NOT IN EXECUTABLE ORDER. 
IT IS IN THE ORDER OF GREATEST CONCEPTUAL CLARITY. ®) 
(¥$S60000%) 


program Dixsplit (Cainput, output); 
const 

maxsent = 1000; 

wordilength = 15; 

linelength = 60; 

phraselength = 4G; 

freqnum = 50; 

selnum = 200; 


type 
Gemretype = (Nonfiction, Fiction); 
Frequencytype = (Frequently, Seldomly, When_Done); 
Balancetype = (Plus, Zero, Minus); 
Durationtype = (Permanent, Temporary); 
Sourcetype = (Latinate, Germanic); 


Difficultytype = (PostGrad, Grad, High_School, Elementary); 
Concretenesstype = (Tangible, Intangible); 
Emotiontype = (Sublime, Pleasant, Nuetral, Unpleasant, Horrid); 
Vigortype = (Violent, Energetic, Calm, Inert); 
blordtype = packed array (.1..wordlength.) of char; 
Linetype = packed array (.1..linelength.) of char; 
Phrasetype = packed array (.1..phraselength.) of wordtype; 
EntryPointer = -Entrytype; 
ConcordPointer = -Concordtype; 
Entrytype = record 

Word : Wordtype; 

Balance : Balancetype; 

Duration : Durationtype; 

Source : Sourcetype; 

Perr iculty : Daifficultytype; 

Concreteness : Concretenesstype; 

Emotion : Emotiontype; 

waocor : Vigortype; 

Left, Right : EntryPointer; 


Down : Concordpointer; 
end; 
Concordtype = record 

Up : EntryPointer; 

Down, 

Next, Last : ConcordPointer; 
end; 


a1 


var 
FourRoot, 
Root : EntryPointer; 
Lastward : ConcordPointer; 
Latinat, 
Germanc, 
Tangble, 
Intngble, 
rOStaia> 
Graduate, 
HighSchl, 
Elementa, 
Sublme, 
Pleasnt, 
Nuetrl, 
Unpleasn, 
Horrd, 
Violnt; 
Energeti, 
Cin. 
Inte, 
Dixonary : text; 


procedure Attach CEntry : entrytype; var p : entrypointer); 
begin 
New (Cp); 
with p- do begin 
word := entry.word; 
duration := permanent; 
source := entry.source; 
dafficulty += entrevvdliticulty, 
concreteness := entry.concreteness; 
emotion := entry.emotion; 
Vigor := entry.vigor; 
left nil; rieht == nv, 
down Ae 
end; 
end; 
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procedure Encode (its : entrytype; var xsource : char; 
var xdifficulty : char; 
var xconcreteness : char; var xemotion : char; 
var xvigor : char); 


begin 
if (its.source = Latinate) then xsource := 'l' 
else xsource := 'g!; 
case its.difficulty of 
Postgrad : xdifficulty := 'p'; 
Grad s xdifficultye- ‘g*; 
figmmsenool : xdiftreculty ==- hh"; 
Elementary : xdifficulty := 'e'; 
end; 
if (its.concreteness = Tangible) then Xconcreteness := 't! 
else Xconcreteness := '1'; 
case its.emotion of 
Sublime : xemotion := 's'; 
Fleasant : xemotion := 'pt; 
Nuetral >: xemotion := 'n!; 
Unpleasant : xemotion := ‘tut; 
Horrid se xemMouiTon @ = nt. 
end; 
case its.vigor of 
Violent oe “Vigor c=) 'vils 
Energetic >? xXvigor := ‘tet; 
Calm [ x<vidgdor t= "e'; 
Inert [MV PGOr f= "ye: 
end; 
end; 
procedure Decode (var its : entrytype; 
xsource : char; xdifficulty : char; 
xconcreteness : char; xemotion : char; 
xvigor : char); 
begin 
if (Xsource = '1') then its.source := Latinate 
else its.source := Germanic; 
case Moatticulty of 
ee: its.difficulty := Postgrad; 
no: «6.1ts. difficulty := Grad; 
ia: 2ats.difticulty := High _School; 
fe": 1ts.difficulty := Elementary; 
end; 
if (Xconcreteness= 't') then its.concreteness := Tangible 
else its.concreteness := Intangible; 
case Xemotion of 
"st ; its.emotion := Sublime; 
ie’): its.emotion := Pleasant; 
tht : its.emotion := Nuetral; 
Tulfo: ats.emotion := Unpleasant; 
iim: 1ts.emotion := Horrid; 
end; 
case Xvigor of 
aye: 1tSs.vigor := Violent; 
fe’ : its.vigor := Energetic; 
me’)? 1tS.vigor := Calm; 
mies 1tS. vigor := Inert; 
end; 
end; 
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procedure AVL_Insert (Entry : entrytype; var p : Entrypointer; 
var balanced : boolean); 
var 
pl, p2 : entrypointer; 
begin 
if (p = nild then begin 
Attach, CEN<try,) Pp). 
p-.balance := Zero; 
balanced := true; 
end 
else if CEntry.word = p-.word) then Writeln C'INSERT COLLISION!) 
else if CEntry.word < p-.word) then begin 
AVL_Insert CEntry, p-.left, balanced); 
If Balanced then (* left pointer has grown higher *) 
case p-.balance of 


Plus : begin p-.balance := Zero; balanced := false; end; 
Zero : p-.balance := Minus; 
Minus : begin (*® rebalance *) 
pl := p-.left; 
if (pl-.balance = Minus) then begin(€*® single LL rotatxX) 
p-~.left := pl-.right; 
pl right =p, 
p-.balance := Zero; 
7= pl; 


Pp 

end (* if X) 

else begin (*% double LR rotation *X) 
p2 := pl-.right; 
pl=-— fight: =" p2>. left; 
Pez bert <= ipl 


p-.left := p2-.right; 
P27. Fight <= 0p; 
if (€p2-.balance = Minus) then p-.balance := Plus 
else p-.balance := Zero; 
if (€p2-.balance = Plus) then pl-.balance := Minus 
else pl-.balance := Zero; 
BPo:= pe; 
end; (* else X) 
p-.balance := Zero; balanced := false; 


end; (€%*% case of Minus *) 
end; (% of cases *%) 
end (* if Entry.word < p-.word X) 
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else if € Entry.word > p-~.word) then begin 
AVL_Insert CEntry, pr-.right, balanced); 
if balanced then (*® right pointer has grown higher *) 
case p-.balance of 


Minus : begin p-.balance := Zero; 
balanced := false; end; 
Zero : p-.balance := Plus; 
Plus : begin (%® rebalance *) 
pli ps CLent; 


if (pl-.balance = Plus) then begin 
(*¥ single RR *) 


B= loca = pls. left; 

Beara = p; 
.balance := Zero; 
:= pul 


end Cx if =x) 

else begin (% double RL rotation *) 
pZ r= 9 pl-. Left; 
pl-.left := p2-.right; 


pe>- Triont += spl; 
B= . tiogmic.: = ec-.left; 
p2- left <= pp; 
a+ eo balance = Plus) then 
-balance := Manus 
aveee Seer := Zero; 
if €p2-.balance = Minus) then 
pl-~.balance := Plus 
else pl-.balance := Zero; 
p := p2; 
end; (€%® double RL rotation xX) 
-,.balance := Zero; balanced := false; 
end; (® case of balance = Plus *®) 


end; (* of cases *) 
ema) (x of if Entry.word > p-.word *X) 
else balanced := false; 
end; (X of procedure AVL_Insert *) 
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procedure Init_Researcher; 
var 
1, 
stepcount, 
threshhold, 
dixlensth : integer; 
S,», d, c, @, V : char; 
entry : entrytype; 
balanced : boolean; 
begin 
Reset (Dixonary, 'Dixonary text a'); 
New CRoot); 
Readln (CDixonary, Root-~.word, s, d, ¢c, e, v)3 
Decode (Roat-, S,7d-.¢, ev). 
Root-.left := mily Root= rignes- 71 


Root-.down := nil; 
Root-.balance := Zero; 
Root-=.duration := Permanent; 
Lastward := nil; 

Stepcount := 0; 


while not EQF (€Dixonary) do begin 
Readln CDixonary, Entry.word, s, d, cy, e, vw); 
Decode (Entry, s, d, ¢€, e, V2 


Entry.balance := Zero; balanced := false; 
AVL_Insert CEntry, Root, balanced); 
stepcount := stepcount + 1; 


if (€stepcount mod 200 = 0) then begin 
Page; Writeln; Writeln; Writeln; Writeln; 
Writeln C'Stylist is now loading its dictionary.'); 
Writeln C'Last word loaded was ', entry.word:16); Writeln; 
NWriteln C'Stylist has loaded ',stepcount:5,' entries.'); 
Wraiteln; Writeln; 
end; (* if xX) 
end; €® while not EOF xX) 
end; 
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procedure Inorder (var Root : entrypointer); 
begin 
if (Root <> nil) then begin 
Inorder CRoot-~. left); 
case Root-.Source of 
Latinate : Writeln (Latinat, Root-.word, 
Germanic : Writeln (Germanc, Root-.word, 
end; 
case Root-~.Concreteness of 
Tangible : Writeln (Tangble, Root-.word, ‘'t':1); 
Intangible : Writeln CInTngble, Root-.word, ‘'1':1); 
end; 
case Root-.difficulty of 
Postgrad : Writeln (Postgrd, Root-.word, 'p':1); 
Grad : Writeln (Graduate, Root-.word, 'g':1); 
High_School : Writeln CHighSchl, Root-.word, 'h':1); 
: Elementary : Writeln CElementa, Root-.word, ‘e'!:1); 
end; 
case Root-~.emotion of 
Sublime : Writeln (Sublme, Root-.word, ‘'s':1); 
Pleasant : Writeln (Pleasnt, Root-.word, 'p':1); 
Nuetral : Wraiteln (CNuetrl, Root=.word, 'n':1); 
Unpleasant : Writeln CUnpleasn, Root-.word, ‘u':1); 
Horrid : Writeln CHorrd, Root-.word, 'h':1); 
end; 
case Root-.vigor of 
Violent : Writeln (CViolnt, Root-.word, 'v':1); 
Fnergetic : Writeln CEnergeti, Root-~.word, ‘e'); 
Calm : Writeln (Clm, Root-.word, 'c'); 
Inert : Writeln CInrt, Root-~.word, '1!); 
end; 
Inorder CRoot-=.right); 
end; 
end; 


ert) ie: 
Caicos 
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begin (* main program *) 


Init_Researcher; 


Rewrite 
Rewrite 
Rewrite 
Rewrite 


Rewrite 
Rewrite 
Rewrite 
Rewrite 
Rewrite 
Rewrite 
Rewrite 
Rewrite 
Rewrite 
Rewrite 
Rewrite 
Rewrite 
Rewrite 
Inorder 


end. 


CLatinat, 
(Germanc, 
(Tangble, 
(Intngble, 


(Postgrd, 
(Graduate, 


'Latinat text a'); 

'Germanc text a'); 

"Tangble textmua oy 
'Intngble text a'); 


'Postgord text a'); 
"Graduate text a'); 
CHighSchl,; “*HighSch!) text sa); 
CElementa, '‘Elementa text a'); 
CSublme, 'Sublme text a'); 
CPleasnt, '‘Pleasnt text a'); 
CNuetrl, ‘Nuwetrl text ae‘; 
CUnpleasn, 'Unpleasn text a'); 
CHorrd, ‘Horrd text a4 ov; 
CViolnt, 'Violnt text a‘); 
CEnergeti, ‘Energeti text a'); 
CClm,. "Clim text a‘); 

Cingt,s-- lant Veext 1a 

CRoct); 
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(*$S60000%) 
program Dixjoin (Cinput, output); 
const 

maxsent = 1000; 

wordlength = 15; 

linelength = 60; 

phraselength = 4; 

freqnum = 50; 

selnum = 200; 


type 
Genretype = (Nonfiction, Fiction); 
Frequencytype = (Frequently, Seldomly, When_Done); 
Balancetype = (Plus, Zero, Minus); 
Durationtype = (Permanent, Temporary); 
Sourcetype = (Latinate, Germanic); 


Difficultytype = (PostGrad, Grad, High_School, Elementary) ; 
Concretenesstype = (Tangible, Intangible); 

Emotiontype = (Sublime, Pleasant, Nuetral, Unpleasant, Horrid); 
Vigortype = (Violent, Energetic, Calm, Inert); 


Wordtype = packed array (€.1..wordlength.) of char; 
Linetype = packed array (.1..linelength.) of char; 
Phrasetype = packed array (.1..phraselength.) of wordtype; 


EntryPointer = -~Entrytype; 
ConcordPointer = -Concordtype; 


Entrytype = record 
Word : Wordtype; 
Balance : Balancetype; 
Duration : Durationtype; 
Source : Sourcetype; 
Het riculty : DPifficultytype; 
Concreteness : Concretenesstype; 
Emotion : Emotiontype; 
Vigor : Vigortype; 
Left, Right : EntryPointer; 
Down : Concordpointer; 

end; 


Concordtype = record 

we: EntryPointer; 

Down, 

Next, Last : ConcordPointer; 
end; 
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Vaile 
FourRoot, 
Root = EntryRointer, 
bastward =: Concordrorncer. 


Vatina te, 
Germanc, 
Tangble, 
Intngble, 
Postgrd, 
Graduate, 
Haegimociis 
Elementa, 
Sublme, 

Pleasnt, 

Nuetrl, 
Unpleasn, 

Horrd, 

Violnt, 
Energeti, 

Cin, 

net, 

Dixonary : text; 
word : wordtype; 
Pp : entrypointer; 


6° 7 char, 
procedure Attach (Entry : entrytype; var 
begin 
New (€p); 
with p- do begin 
word := entry.word; 
duration := permanent; 
source := entry.source; 
difficulty .:=" entry wdi ffreul cy; 
concreteness := entry.concreteness; 
emotion := entry.emotion; 
Vigor <= entry.vigor; 
left <= nid; riogme := nil; 
down := nil; 
end; 
end; 


p 


entrypointer); 


procedure Encode (its : entrytype; var xsource : char; 
VanexcartTriculty *« char; 
var xconcreteness : char; var xemotion : char; 
var xvigor : char); 
begin 
if Cits.source = Latinate) then xsource 
else xsource 


i 
ro! ; 


Base its.difficulty of 


Postgrad oexdifricultye:—— pl, 
Grad 2: xdifficulty <= ‘'g*; 
Gigheschool =: xdifficulty <= h'; 
Elementary =: xdifficulty = “e'; 
end; 
if (its.concreteness = Tangible) then Xconcreteness := 't! 
else Xconcreteness := '1'; 
case its.emotion of 
Sublime : xemotion := 's!; 
Pleasant $+ xemotion := 'p?’; 
Nuetral >: xemotion := 'n'; 
Unpleasant : xemotion := ‘ul; 
Horrid >: xemotion := th'; 
end; 
case its.vigor of 
Violent [xXVIgors:= “vl; 
Energetic : xVigor := ‘'e!; 
Calm : xVigor :=""e"; 
Inert xvigers:= "1°; 
end; 
end; 
procedure Decode (var its : entrytype; 
xsource : char; xdifficulty : char; 
xconcreteness : char; xemotion : char; 
xvigor : char); 
begin 
if (Xsource = 'l') then its.source := Latinate 
else its.source := Germanic; 
ease Adifficulty of 
fore: its.difficulty := Postgrad; 
moe: 1ts.difficulty := Grad; 
Mae: ats.difficulty <«= High_School; 
Fe’ : its.difficulty := Elementary; 
end; 
if (Xconcreteness= 't') then its.concreteness := Tangible 
else its.concreteness := Intangible; 
case Xemotion of 
's' ;: its.emotion := Sublime; 
p>’ : its.emotion := Pleasant; 
Tn! : its.emotion := Nuetral; 
fu' : aits.emotion := Unpleasant; 
"h' : its.emotion := Horrid; 
end; 
case Xvigor of 
'v' : its.vigor := Violent; 
'e' : its.vigor := Energetic; 
Mar): 1tS.vigor := Calm; 
mr > Its. vigor := Inert; 
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procedure AVL_Insert CEntry : entrytype; var p : Entrypointer; 
var balanced : boolean); 
var 
pl, p2 : entrypointer; 
begin 
if €p = nil) then begin 
Attach CEntry, 2p0. 
p-.balance := Zero; 
balanced := true; 
end 
else if CEntry.word = p-~.word) then Writeln C'TINSERT COLLISION!) 
else if CEntry.word < p-.word) then begin 
AVL_Insert CEntry, p-.left, balanced); 
If Balanced then (® left pointer has grown higher *X) 
case p-.balance of 


Plus : begin p-.balance := Zero; balanced := false; end; 
Zero : p=.balance := Minus; 
Minus : begin (* rebalance *) 


ol :="p=. left; 
if (€pl-.balance = Minus) then begin(€® single LL rotatx) 
p-.left := pl-.right; 
pl>=. right <= p; 
p-~. balance := Zero; 
p := pl; 
end (XX 1% *) 
else begin (X® double LR rotation xX) 
pe +2 9 pl- origin; 
pl-.right := p2-.left; 
o2=. lef<c 3- spl; 
p- left := p2-.Fright, 


P2e=-fignt. i= Pp; 
if (€p2-.balance = Minus) then p-.balance := Plus 
else p-.balance := Zero; 
if (€p2-.balance = Plus) then pl-.balance := Minus 
else pl-~.balance := Zero; 
p := p23 

end; (*® else *) 

p-~.balance := Zero; balanced := false; 


end; (*® case of Minus *) 
end; (X% of cases xX) 
end (€% if Entry.word < p-.word *X) 
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else if ( Entry.word > p-.word) then begin 
AVL_Insert CEntry, p-.right, balanced); 
if balanced then (*® right pointer has grown higher *) 
case p-.balance of 


Minus : begin p-.balance := Zero; 
balanced := false; end; 
Zero : p-.balance := Plus; 
Plus : begin (X® rebalance *) 
pl := p-.right; 


if (pl-.balance = Plus) then begin 
(*% single RR xX) 


p-~.right := pl-.left; 
pl-.left := p; 
p-.balance := Zero; 


p 3:5 pl; 
end (* if *) 
else begin (* double RL rotation *X) 


pe := pl-.left; 
Pl-=.left := e2-=.right; 
Perens dae += pl; 
DowGlent $= p2e-.lert; 


p2-.left 7= p; 
if (p2-.balance = Plus) then 


p-.balance := Minus 

else p-.balance := Zero; 

if (p2-.balance = Minus) then 
pl-~.balance := Plus 

else pl-.balance := Zero; 


Pp := p2; 
end; (€*® double RL rotation xX) 
p-.balance := Zero; balanced := false; 
end; (* case of balance = Plus xX) 
end; (* of cases *) 
end (* of if Entry.word > p-.word xX) 
else balanced := false; 
end; (* of procedure AVL_Insert *) 
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procedure Init_Researcher; 
var 
cee 
stepcount, 
threshhold, 
dixlength : integer; 
SS; dy c> @€, V = Fenmnar; 
entry : entrytype; 
balanced : boolean; 
begin 
Reset (Dixonary, 'Dixonary text a'); 
New CRoot); 
Readln (Dixonary, Root-.word, s, d, ¢c», e, vj; 
Decode CRoot-, S» ae Cries Vo 
Root-~.left := nil; Root-.right := nil; 


Root~.down := nil; 
Root-~.balance := Zero; 
Root-~.duration := Permanent; 
Lastward := nil; 

Stepcount := 0; 


while not EOF CDixonary) do begin 
Readln (Dixonary, Entry.word, s, d, ¢, e», vj} 
Decode CEntry, s, d,>, ¢c, e, v);3 


cntry.balance := Zero; balanced := false; 
AVL_Insert CEntry, Root, balanced); 
stepcount := stepcount + 1; 


if (€stepcount mod 200 = 0) then begin 
Page; Writeln; Writeln; Writeln; Writeln; 


Writeln C'Stylist is now loading its dictionary. '); 
Writeln (€'Last word loaded was ', entry.word:16); Writeln; 


Writeln C'Stylist has loaded ',stepcount:5,!? 
Writeln; Writeln; 
end; “CX“if 
end; (€* while not EQF X) 


end; 
procedure Inorder (var Root : entrypointer; 
word : wordtype; 
var P : entrypointer); 
begin 
if €Root-~.word = word) then p := root 


else if Croot-.word > word) then Inorder (CRoot-.left, 
else Inorder CRoot-~.right, word, p); 
end; 


procedure Store BS? (Cp ;: entrypointer); 
Var S» ds» Cs @C» V : char; 
begin 
1f €p <> nil) then begin 
Encode (p-, S> d, CC,» @>» vo 
Writeln (Dixonary, p-~.word, S» d, ¢c» @, vj); 
Store Bole (p> ler); 
Store _ BST (p=. right) > 
end; 
end; 


procedure Store_New_Dix; 

begin 
Rewrite (Dixonary, 'Dixonary text a'); 
Store_BST CRoot); 

end; 


entries.'); 


word, p) 


begin (X¥ main program X) 
Init_Researcher; 


Reset (Latinat, ‘Latinat text a‘); 

While not EOF (CLatinat) do begin 
Readin (Latinat, word, co); 
if € ¢ <> 'l') then begin Inorder (Croot, word, p); 
p-.,source := germanic; end; 

end; 


Reset (Germanc, 'Germanc text a'); 

While not EOF (Germanc) do begin 
Readln (Germanc, word, c); 
if € ¢ <> tgt) then begin Inorder Croot, word, p); 
p-.source := latinate; end; 

end; 


Reset (Tangble, '‘Tangble text a'); 

While not EOF (Tangble) do begin 
Readin (Tangble, word, oc); 
iy € ¢ <> *t*) then begin Inorder Croot, word, p); 
P-~.concreteness := intangible; end; 

end; 


Reset (Intngble, ‘Intngble text a'); 
While not EOF CInTngble) do begin 
Readln CIntngoble, word, oc); 
if € ¢ <> '1') then begin Inorder Croot, word, p); 


p-.concreteness := tangible; end; 
end; 


Reset (Postord, ‘'Postord text a'); 
While not EOF (€Postgord) do begin 
Readin (€Postord, word, c); 
1f € ¢ <> tpt) then begin Inorder Croot, word, p); 


mretce = 'o') then p-.difficulty := grad; 

if €c = 'h') then p-.difficulty := high_school; 

if €c = te') then p-.difficulty := elementary; end; 
end; 


Reset (Graduate, ‘Graduate text a'); 
While not EOF (Graduate) do begin 
Readin (Graduate, word, c); 
if € ¢ <> 'g') then begin Inorder Croot, word, p); 


mr Coe = 'p') then p-.difficulty := postgrad; 

fr cc = “h') then p-.difficulty := high school; 

if €c = tet) then p-.difficulty := elementary; end; 
end; 


Reset (HighSchl, 'HighSchl text a‘); 
While not EOF CHaighSchl) do begin 
Readin CHighSchl, word, oc); 
if € ¢ <> th*) then begin Inorder Croot, word, p); 


1f €c = 'p') then p-.difficulty := postgrad; 

if €c = 'gt) then p-.difficulty := grad; 

if €c = tet) then p-.difficulty := elementary; end; 
end; 


Reset (Elementa, ‘Elementa text a'); 
While not EOF CElementa) do begin 
Readln (CElementa, word, oc); 
if € ¢ <> tet) then begin Inorder Croot, word, p); 


mimecc - 'p') then p-.difficulty := postgrad; 
if €c = *g*) then p-.difficulty := grad; 
a Coc = ht) then p-.difficulty := high school; end; 
end; 
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Reset (Sublme, ‘'Sublme text a‘); 
While not EOF (CSublme) do begin 
Readln (Sublme, word, oc); 


if 7 Ge — 's') then begin Inorder (root, word, 


if €c = 'p') then p-.emotion 

ifecer= Bs ) then p-.emotion 

if €c = 'u't) then p-.emotion 

if €c = 'h') then p-.emotion 
end; 


Reset (Pleasnt, 'Pleasnt text a‘) 
vihile not EOF (Pleasnt) do begin 
Readln (Pleasnt, WOrd, <<); 

if € c <> 'p') then begin Inor 


if (ee = "s') then 6-. emotion 

if €e = 'n'? then p-.emation 

if (ec = *utd then -p- emotion 

if Ce = "h') “then p- emotion 
end, 


Reset (Nuetrl, 'Nuetrl text a'); 
While not EOF (Nuetrl) do begin 
Readln (Nuetrl, word, c); 
if € c <> *n') then begin Inor 


ff .Cc- = SI } then p-.emotion 

1f €c = 'p')} then p-.emotion 

if €c = *u') them S- Jemocion 

1f €c = 'h') then p-.emotion 
end; 


Reset (Unpleasn, 'Unpleasn text a 
While not EOF (CUnpleasn) do begin 
Readln (CUnpleasn, word, oc); 


pleasant; 
nuetral; 
unpleasant; 
horrid; end; 


> 


der Croot, word, 
sublime; 
nuetral; 
unpleasant; 
horrid; end; 


der Croot, word, 
sublime; 
pleasant; 
unpleasant; 
horrid; end; 


J 


if € e¢ <> tu’) then begin Inorder Croot, word, 


if Ce.-= si } then p-.emotion 

1f €c = 'p')} then p-=.emotion 

1f €c = 'n') then p-.emotion 

1f €c = 'h') then p-.emotion 
end; 


Reset (Horrd, ‘Horrd text a'); 
While not EOF CHorrd) do begin 
Readln CHorrd, word, oc); 
1f € € <> "R'): then begin Inor 


af Cee = is! } then p-.emotion 

1f €c = 'p')} then p-~.emotion 

1 2Ces = Teer then p-.emotion 

if €c = fut) then p-.emotion 
end; 


Reset (Violnt, "Violmt text a‘); 
While not EOF CViolnt) do begin 
Readln (Violmt, word, 7c); 


sublime; 
pleasant; 
nuetral ; 
horrid; end; 


der Croot, word, 
sublime; 
pleasant; 
nuetral; 


if € c <> ‘v!) then begin Inorder Groot, wore, 


17 Cc = ‘e'") then pr Vigor: 

if €e¢ = *e') then p>. Visere— 

if (c = '1') then p-.viger := 
end; 


Reset (Energeti, 'Energeti text a 
While not EOF CEnergeti) do begin 
Readin (CEnergeti, word, c); 


energetic; 
calm; 
inert; end; 


Oe 


it € ¢ <> te then begin Inornder (root, word, 


if €c = 'v!) then p-. vigor : 5 

if €c¢ = "*c') then p-.vigor =: 

if €c°= 3°) then p=. Vigor = — 
end; 


Violent; 
calm; 
inert; end; 


Pd); 


P); 


PJ; 


pP); 


Pp); 


unpleasant; end; 


pd; 


pP); 


_ —~ 


Reset (Cl 
While not 
Readln 


m, 'Clm text a‘); 
EOF (Clim) do begin 
CGlm, word, c); 


if € ¢ <> '¢') then begin Inorder Croot, word, p); 


Itc 

tc 

mee Ce 
end; 
Reset CIn 


While not 
ReadIin 


'v') then p-~.vigor 
'e') then p-.vigor 
'3') then p-.vigor 


Gtom inrt text at); 
EGE Cinrt) do begin 
CIinrt, word, c); 


Violent; 
energetic; 
inert; end; 


if € ¢ <> '1') then begin Inorder Croot, word, p); 


mr Cc 

ea CC 

a7 CC 
end; 


Store_ 


end. 


'v') then p-~.vigor 
'e') then p-.vigor 
'c!') then p-~.vigor 


oa a 


New_Dix; 
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Vrolenc; 
energetic; 
calm; end; 


APPENDIX E 
REPRESENTATIVE RUNS 


PROETIFE 


Plato's Phaedrus 


Total of sentences : 17 
Total of words : 360 
Total of letters : 1595 


360 words are too few for valid statistical analysis. 


Of the 360 words in Plato's Phaedrus 
312 were matched to words in the Stylist dictionary. 


86 % were matched. 
This 18 enough for statistical analysis. 
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BREAKDOWN OF PERCENT OF LETTERS PER WORD 


30 
28 
26 
24 
22 
20 
18 
16 
14 


ny FP DBD fw 2 
~~ KM mM KM WM WM MK MK 
SN KM WM MRM WM KK 
Oo «K WK wW 
>< 


6 


X 
K 
x 
x 
x 
x 
X 
X 
X 
4 
r etters in a Wo 


x 
x 
x X 
x x 
x x 
x x 
x xX 
x x 
5 8 

umber of L rd 

The average number of letters per word 6 ol g2657 TE+O0 

A typical nonfiction texts distribution 


resembles a low bell-shaped curve centered around six letters/word. 


The lensth of the words is medium 
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BREAKDOWN OF NUMBER OF WORDS PER SENTENCE 
Number of word 
100 : 


MOONHADNHDWON AXHAWO 


56 


WG 
ts 
PK DK DK OK OK OOK OK OK 


13 KKK XX 

RIOR XK RX 

KAKA KKK ORK 

KAKA KKRK 
ROOK 
KKK ROKK KK eK 
MK KAA RK AK KR 
KAKKKAKKKK KKK 
OOO OC OCR OE POO CEO 4 

10 20 Su G0 50 60 70 
Sentence Number 


NO 
© 
>< 
>< 
< 
>< 
>< 
>< 
>< 
PK OK DK OK OK OK OK O>*K 


ed ee 
NO HWW On HO 


The average number of words per sentence : 2117667059 E.oR 

A typical modern texts sentences average between fifteen and twenty words 
Sentences are long. 

Number of run ons 0 


Percent of run ons 0 
Run ons are nonexistant. 


Number of medium length sentences 9 
Percent of medium length sentences 52 


Modulation 1s good. 
ETYMOLOGY OF WORDS 
Number of Latinate words : Sal 


Number of Germanic words : 95 


Etymology is very native. 


DIGEICULTY GF VOCABULARY 


PostGraduate difficulty : 0 
Graduate difficulty : l 
migim oOchool difficulty : ZS 
Elementary difficulty : 120 
Percent of Postgraduate difficulty 0 
Percent of Graduate difficulty 0 
Percent of High School difficulty 8 


Drnhiculty is easy. 


TANGIBILITY 
Number of Tangible words : 35 
Number of Intangible words : Pit 


Tangibility is tangibible. 


EMOTIONAL CONNOTATIONS 


Sublime connotations : 7 
Pleasant connotations ; GG 
Neutral connotations : 87 
Unpleasant connotations : 8 
Horrid connotations : 0 


Percent of sublime connotations Z 
Percent of pleasant connotations 14 
Percent of unpleasant connotations a 
Percent of horrid connotations 0 
Index of Emotionality G2 
Emotionality is Rich 


Tone is Positive 


VIGOR 


Words of Extreme Vigor 

Words of Much Vigor 

Words of Some Vigor 

Words of Little Vigor 
Percent of words of extreme vigor 
Percent of words of much vigor 
Percent of words of some vigor 
Index of Vigor G9 


Vigor is lively 


Congratulations! 


OF WORDS 


You never seem to write run on sentences. 


This 18 a solid piece of writing well within the traditions of its genre. 


PZ 


CONCORDANCE 


Phaedrus 
them and thamus enquired ABOUT their several uses and 
discovered is not an ArD to memory but to 


other egyptians might be ALLOWED to have the benefit 
an art is not ALWAYS the best judge of 
is called by them AMMON to him came theuth 


learned nothing they will APPEAR to be omniscient and 
censured others as he APPROVED or disapproved of them 

many arts such as ARITHMETIC and calculation and geometry 

or inventor of an ART is not always the 

the inventor of many ARTS such as arithmetic and 

blame of the various ARTS but when they came 

calculation and geometry and ASTRONOMY and draughts and dice 
have been led to ATTRIBUTE to them a quality 

in the learners' souls BECAUSE they will not use 

your own children have BEEN led to attribute to 

allowed to have the BENEFIT of them he enumerated 

1s not always the BEST judge of the utility 

wiser and give them BETTER memories it is a 

name was theuth the BIRD which is called the 

theuth in praise or BLAME of the various arts 

it 1S a specific BOTH for the memory and 

such as arithmetic and CALCULATION and geometry and astronomy 
egypt which the hellenes CAE egyptian thebes and the 


the bird which is CALLED the ibis 1s sacred 

the god himself is CALLED by them ammon to 

them ammon to him CAME theuth and showed his 

arts but when they CAME to letters this said 

a quality which they CANNOT have for this discovery 
some of them and CENSURED others as he approved 

to the external written CHARACTERS and not remember of 
love of your own CHIL DREN have been led to 

socrates at the egyptian Ciley of naucratis there was 
dwelt in that great Cli of upper egypt which 

they will be tiresome COMPANY having the show of 

king of the whole COUNTRY of egypt and he 

discovery of yours will CREATE forgetfulness in the learners’ 
letters now is those DAYS the god thamus was 

and showed his inventions DESIRING that the other egyptians 
astronomy and draughts and DIGE but his great discovery 
as he approved or DISAPPROVED of them it would 

and you give your DISCIPEES not truth but only 

specific which you have DISCOVERED 1s not an aid 

dice but his great DISCOVERY was the use of 

cannot have for this DISCOVERY of yours will create 
geometry and astronomy and DRAUGHTS and dice but his 

of egypt and he DWELT in that great city 

the whole country of EGYPT and he dwelt in 


great city of upper EGYPT which the hellenes call 
Socrates at the EGYPTIAN Ercy Jot maucratis there 
which the hellenes call EGYPTIAN thebes and the god 


desiring that the other EGYPTIANS might be allowed to 

theuth will make the EGYPTIANS wiser and give them 
enumerated them and thamus ENQUIRED about their several uses 
benefit of them he ENUMERATED them and thamus enquired 

will trust to the EXTERNAL written characters and not 
naucratis there was a FAMOUS old god whose name 

you who are the FATHER of letters from the 

of yours will create FORGETFULNESS in the learners! souls 

be omniscient and will GENERALLY know nothing they will 


arithmetic and calculation and GEOMETRY and astronomy and draughts 
the egyptians wiser and GIVE them better memories it 
to reminiscence and you GIVE your disciples not truth 


PS 


was a famous old GOD whose name was theuth 

is those days the GOD thamus was the king 

egyptian thebes and the GOD himself is called by 

and dice but his GREAT discovery was the use 

he dwelt in that GREAT city of upper egypt 

will be tiresome company HAVING the show of wisdom 
truth they will be HEARERS of many things and 

upper egypt which the HEC EENes call egyptian thebes and 
thebes ana the god HIMSELF 1s called by them 

which is called the IBIS 1s sacred to him 

thamus replied o most INGENIOUS theuth the parent or 
them and in this INSTANCE you who are the 

Of tme utility “or TNUTICITY of his own inventions 
theuth and showed his INVENTIONS desiring that the other 
ainutility of his own INVENTIONS to the users of 

and he was the INVENTOR of many arts such 

theuth the parent or INVENTOR of an art 1S 

not always the best JUDGE of the utility on 

god thamus was the KING of the whole country 

omniscient and will generally KNOW nothing they will be 
things and will have LEARNED nothing they will appear 
create forgetfulness in the LEARNERS! souls because they will 
own children have been LED to attribute to them 

was the use of LETTERS now 1s those days 

when they came to LETTERS this said theuth will 

are the father of LETTERS from the paternal love 

it would take a LONG time to repeat all 

letters from the paternal LOVE of your own children 
this said theuth will MAKE the egyptians wiser and 

was the inventor of MANY arts such as arithmetic 

will be hearers of MANY things and will have 

and give them better MEMORIES it 1s a specific 

will not use their MEMORIES they will trust to 

specific both for the MEMORY and for the wit 


not an aid to MEMORY but to reminiscence and 

that the other egyptians MIGHT be allowed to have 

wixt thamus replied o MOST ingenious theuth the parent 
famous old god whose NAME was theuth the bird 

the egyptian city of NAUCRATIS there was a famous 

and will have learned NOTHING they will appear to 

and will generally know NOTHING they will be tiresome 
the wit thamus replied 0 most ingenious theuth the 
there was a famous OLD god whose name was 

will appear to be OMNI SCIENT and will generally Know 
inventions desiring that the OTHER egyptians might be allowed 
of them and censured OTHERS as he approved or 

OF inutilaty of fis OWN inventions to the users 
paternal love of your OWN children have been led 

most ingenious theuth the PARENT or inventor of an 

of letters from the PATERNAL love of your own 

Said to theuth in PRAISE or blame of the 

their several uses and PRAISED some of them and 


attribute to them a QUALITY which they cannot have 

of wisdom without the REALITY 

written characters and not REMEMBER of themselves the specific 
to memory but to REMINISCENCE and you give your 

a long time to REPEAT all that thamus said 


for the wit thamus REPLIED o most ingenious theuth 
called the ibis is SACRED to him and he 
repeat all that thamus SAID to theuth in praise 


came to letters this SAID theuth will make the 
truth but only the SEMBLANCE of truth they will 


thamus enquired about their SEVERAL uses and praised some 
tiresome company having the SHOW of wisdom without the 
him came theuth and SHOWED his inventions desiring that 


SOCRATES at the egyptian city 


several uses and praised SOME of them and censured 


forgetfulness in the learners' SOULS because they will not 
memories it 1s a Srecrric both for the memory 
remember of themselves the SPECIE IC which you have discovered 


of them it would TAKE a long time to 

those days the’ god THAMUS was the king of 

he enumerated them and THAMUS enquired about their several 
to repeat all that THAMUS said to theuth in 

and for the wit THAMUS replied o most ingenious 

the hellenes call egyptian THEBES and the god himself 
and thamus enquired about THEIR several uses and praised 
they will not use THEIR memories they will trust 

and not remember of THEMSELVES the specific which you 
egyptian city of naucratis THERE was a famous old 

god whose name was PHEUTA the bird which is 

ammon to him came THEUTH and showed his inventions 
that thamus said to THEW in praise or blame 

to letters this said THEUTH will make the egyptians 
replied o most ingenious THEUTH the parent or inventor 
be hearers of many THINGS and will have learned 

of letters now 15S THOSE days the god thamus 

would take a long TIME to repeat all that 

nothing they will be TIRESOME company having the show 
their memories they will TRUST to the external written 
give your disciples not TRUTH but only the semblance 
only the semblance of TRUTH they will be hearers 

that great city of UPPER egypt which the hellenes 
great discovery was the USE Of Letters now 15 

because they will not USE their memories they will 

own inventions to the USERS of them and in 

enquired about their several USES and praised some of 
best judge of the UTE r ie Or anutility of his 

or blame of the VARIOUS arts but when they 


the various arts but WHEN they came to letters 

was theuth the bird WHICH 1s called the ibis 

Clty et upper egypt WHICH the hellenes call egyptian 

to them a quality WHICH they cannot have for 

of themselves the specific WHICH you have discovered is 


the king of the WHOLE country of egypt and 
a famous old god WHOSE name was theuth the 
having the show of WISDOM without the reality 


will make the egyptians WISER and give them better 

memory and for the WIT thamus replied o most 

the show of wisdom WITHOUT the reality 

disapproved of them it WOULD take a long time 

trust <o the external WRITTEN characters and not remember 
for this discovery of YOURS will create forgetfulness in 


Student Paper. Example of Poor Writing 


PROFILE 
Total of sentences : 62 
Total of words : 2026 
Total of letters : 10561 
2026 words are plenty for valid statistical analysis. 


Of the 2026 words in Student Paper. Example of Poor Writing , 
1839 were matched to words in the Stylist dictionary. 


90 * were matched. 
This is enough for statistical analysis. 
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BREAKDOWN OF PERCENT OF LETTERS PER WORD 


~~ «KM WK WM KR MK 


X 


1 Z 3 
Numbe 


ww OK OPK, OR PS ee 
Oo «Ke WK MWK 
~< 


6 


xX 
x 
x 
x 
x 
x 
x 
X 
5 
Letters in a Wo 


X 

x X 

X X 

6 x x 
G x x x 
2 x X 
G 8 

r Ee 


oT d 


The average number of letters per word f 629218877 13EF00 


A typical nonfiction texts distribution 
resembles a low bell-shaped curve centered around six letters/word. 


The length of the words is long. 


eZ 


l BREAKDOWN OF NUMBER OF WORDS PER SENTENCE 


Number of words 
0 


nv 
98 
96 
94 
oc 
90 
88 
86 
84 
82 
80 
78 
78 x 
4 x 
V2 x 
70 x 
638 x 
66 x 
64 x x x 
62 x x x 
0 x x x 
58 XX x x 
56 XX x x 
54 XXX x x 
52 XXX x x 
50 XXX x x 
G3 XXX x XX 
G6 XXXX x XX X 
GG  XXXX X X XX X 
G2  XXXX X X xX X 
G0 XXXX Aax XX X 
33 XXXX X X XX XK X 
36 XXXKX xX X XX X X X X 
3G  XXXX X X XX XX X X X X 
32 XXXX X X XX XX X X X X 
30 AXX X XXXXX XXXX X X X 
28 XXXX X XXXXX XXXXX X AUK 
26 XXXX X XK XXXXX XXXXX X XX XX 
24 XXXX X X XXXXX XXXXXXX XX XX 


= XXXX XX XX XXXXX XK KXXKXXK KX XX 
20 XXXX XX XK XXXXX XK XXXXKKKX XXKXKXX 
13 XXXX XX XX XXXXX XX XXXKXXXK XXKXX 
16 XXXX XX KX XXXKX KX XXKXXXXXXKXKKXXK 
XXXKX XX XK KXXXXKX XXK XKXXXXXXKXKKXXXK 
XXXX XX XX XKXXKK XXK KKXXXXXKXXXXKKX 


>< >*€ DK DK DK DK OK DK OK DK DK OK DK OK DK DKK DK DEK OK O*EK 


XXX 

XXX X 
XXX X 
XXX X 
XXXXX 
XXXXX 
XXXXX 
XXXXX 
XXXXX 


>< >*K DK DK DK DK DK DK DK DK DK DKK DK OOK DED DK DKK OK OK OK O*K 


XXXXXKX X 
XXXXXXXX 
XXXXKXXX 


XXXXXXKXXKKXXKXXXK 
XXKXXXKXXKKXXKXXXKKXXKKXKKKKKXK 
XXXX XXK XX XXXKX KKXXXKKXXXKXXKXXKKXKXXKKXKKKKXKXKKKKKKKXKXKXKKKXXKXKKKKX 


x 

x 

x 

x 

XX 

XX 

XX 

XX 
x XX 
x XX 
x XX 
XX XX 
XX XX 
XX XX 
XX XXX 
XX XXX 
XX XXX 
XXXXXKXX 
XXXXXXXX 
XXXXXKXX 
XXXXXXKXX 
XXXXKXXXKXKX 
XXXXXXXXKXK 


XXXXXXXXKX 


XXXXXXKXKX XX KXKXXXKXKKXKXXXKKXKKXXKKKKKXXKKKKXXKKKK KX KKXKKKKXKKXKXKKKXKXKKXK 
XXKXXXKXKXXX XXKXXKXXKXKXKXKKKKKKXKXK KKK KKXKAKKKKKKAKKXKKKXKKAKAKXKKXKXKKAXKKXK 
XXXXXKXKXKXXXXKKKKKXK KK KKK XK KKK KKK KKK KKK KKK KKK KK KKK KKK AKKKKKKKKKKK 


10 20 30 


40 


Sentence Number 


The average number of words per sentence 


A typical modern texts sentences average 
Sentences are too long. 


Number of run ons is 
Percent of run ons 20 


Run ons are unacceptable 


between fifteen and twenty words. 


4 
Z 
0 
8 XKXXXXKXXK XK KKXXXXKKXXXKKKKXKXKXKKKKXKXKXKKKXKKKKKKXKKKKXKKKKXKKKKKXKKKKKX 
6 
4 
2 


0 


60 


3.267791 95a E008 


70 


Number of medium length sentences 14 
Percent of medium length sentences Le 


Modulation is average 
ETYMOLOGY OF WORDS 
Number of Latinate words : 708 


Number of Germanic words : 402 


Etymology 1S mixed 


DIFFICULTY OF VOCABULARY 


PostGraduate difficulty : 26 
Graduate difficulty : 141 
eaagm ochool difficulty ; G05 
Elementary difficulty : 538 
Percent of Postgraduate difficulty l 
Percent of Graduate difficulty 7 
Percent of High School difficulty 22 


Difficulty is very hard 


TANGIBILITY 
Number of Tangible words : 343 
Number of Intangible words es: 767 


Tangibility is tangibible. 


EMOTIONAL CONNOTATIONS 


Sublime connotations 
Pleasant connotations 
Neutral connotations 
Unpleasant connotations 
Horrid connotations 


00 
“JON ON 
Mm Uke ON ~J 


Percent of sublime connotations 
Percent of pleasant connotations 
Percent of unpleasant connotations 
Percent of horrid connotations 


onwoo 


Index of Emotionality 26 
Emotionality is Rich 


Tone is Positive 


ig 


VIGOR OF WORDS 


Words of Extreme Vigor =: a 

Words of Much Vigor : 94 

Words of Some Vigor : G64 

Words of Little Vigor : 551 
Percent of words of extreme vigor : 0 
Percent of words of much vigor : 5 
Percent of words of some vigor ; 25 
Index of Vigor 50 


Vigor is lively 


RECOMMENDATION NUMBER 1 


You tend to write run-on sentences. 
Check your longest sentences TOT run ons. 
Break them up into units of single ideas. 


RECOMMENDATION NUMBER 2 


Your average sentences are too long for the difficulty of your vocabulary. 
Use simpler words or shorter sentences. 


120 


Excerpt from A Farewell to Arms 
PROFILE 


Total of sentences : 65 
Total of words : 1349 
Total of letters : 5431 


1349 words are plenty for Valid statistical analysis. 


OT the 13969 words ain Excerpt from A Farewell to Arms 
1197 were matched to words in the Stylist dictionary. 


8&8 % were matched. 
Thais is enough for statistical analysis. 


£21 


BREAKDOWN OF PERCENT OF LETTERS PER WORD 


30 
28 
26 
24 X 
22 X 
20 X 
13 X 
16 X 
14 X 
12 : x 
10 X 
X 
X 
X 
X 
4 
r 


~~ K K KR KR KK 


Nm BH ao 
SN KW WK KOK 


6 
etters ina Wo 


x 
xX 
x 
xX 
x 
X 
x 
x 
x 
x 
x 
xX 
; 10 11 12 Ls 14 i> 


Xx 
X 
x x 
x Xx 
x X x 
3 8 
N of fe 


umbe d 


The average number of letters per word > 5§.100817439E+00 
A typical fietion texts distribution 


resembles a tall bell-shaped curve centered around five letters/word. 


The length of the words is medium 
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1 BREAKDOWN OF NUMBER OF WORDS PER SENTENCE 
Number of words 


IN 

© 
PK KE OK DEK DK OK OK OK OK OK OEE 
>“ >*K OK DK OOK OK DK DK DEK OOK 


xX 
X 
xX 
XX X 
XX X X 
X XX X xX 


xX 

x 

x XX XX XX X 
. XX X XX XX XXX 
x 
x 
x 


NO 
oo 
>< 
>< 
>< 
>< 
>< 0K DK DK OK DK OOK OK OK OE D*K DFE OOK OEE OK OD>*K 


26 XX X XXXX XX 
24 XX X XXXX XX XX 
22 XX X X XXXX XX XX 
29 XXXX XXXXXX XX XX 
18 XXXX XXXXXXXXX XXX 
16 XXXX XXXXXXXXXXK XXXXXX 
14  XXXX XXXXXXXXXXX XXXXXX 


XX X X XX XX XXX 
XX X X XX XXXXXX 
XX X XXXXXXXXXXXKX 
XXX X XXXXXXXXXXXX 


mK 0K OK OK OK OOK 


>< 0K OK OK OK DK 
P< OK DK OK OK OK OK 


>< >< OK OK 


X 
X XX X 
2 XXXX XXXXXXXXXXX XXXXXXKX XXXK XX X X X XXX XXX X XXXXXXXXXXXX 
O XXXX XXXXXXXXXXXXKXXXKXX XXXX XX KXXK KX XXXXX KXX KXXXXXXXXXXXKXXKX 
8 XXXX XXXXXXXXXXXXXXXKXXX XXXXXKX XXXXXXKXKXKXKK XXK KXXXKXXXXXKKXXKKXKX 
; XXXXXXXXXXXXXKXKKKXKAXKXKXKKXKKXXKKXAXKXKXXAXKKXXXKXXKXK XKXXXXXKXXXXXXXKXKXXXXX 
2 


KK OK OK 
>< OK >KEOKEO*EK 


XXXXXXXXXKKXXXKKKXXKXKXXKKKXKXXKKXKXKKKXKXKKXKKXKKKXKKKKXKKXKXKXKKKXAKKKXXKKXK 
AAI OA RAMAN BAALERARAARARAKE 


Sentence Number 


The average number of words per sentence >: 2.075384615E+01 


A typical modern texts sentences average between fifteen and twenty words. 
Sentences are long. 


Number of run ons 5 
Percent of run ons G 


Run ons are acceptable. 


E25 


Number of medium length sentences 26 
Percent of medium length sentences G0 


Modulation is average 
ETYMOLOGY OF WORDS 
Number of Latinate words : or 


Number of Germanic words : G34 


Etymology 18S very native. 


DIFFICULTY OF VOCABULAR, 


PostGraduate difficulty : 0 
Graduate difficulty : 0 
Hign Scnool, dirriculcy : 25 
Elementary difficulty : 596 


Percent of Postgraduate difficulty 
Percent Of Graduate -ditriculty 
Percent of High Sehool “drtficulty 


NOG 


Ditficulty is easy: 


TANGIBILITY 


Number of Tangible words : 179 
Number of Intangible words : G42 


Tangibility is tanoibible. 


EMOTIONAL CONNOTATIONS 


Sublime connotations : 1 
Pleasant connotations : 135 
Neutral connotations : G50 
Unpleasant connotations : 34 
Horrid connotations : 1 


Percent of sublime connotations 

Percent of pleasant connotations i 
Percent of unpleasant connotations 

Percent of horrid connotations 


ONr oO 


Index of Emotionality 26 
Emotionality is Rich 


Tone 1s Positive 
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VIGOR OF WORDS 


Words of Extreme Vigor =: G 

Words of Much Vigor : 73 

Vlords of Some Vigor : 324 

Words of Little Vigor : 220 
Percent of words of extreme vigor : 0 
Percent of words of much vigor : 6 
Percent of words of some vigor : 27 
Index of Vigor 57 


Vigor 1s strong 


This 1s a solid piece of writing well within the traditions of its genre. 
You are as able to understand the meaning of the above characteristics as The St 


Vilist. 
Stand the course! 


12) 


Computer Science Text 1] 


PROFILE 
Total of sentences : 56 
Total of words : 1178 
Total of letters : 5868 
1178 words are Plenty for valid statistical analysis. 


Of the 1178 words in Computer Science Text 1 
1026 were matched to words in the Stylist dictionary. 


87 % were matched. 
This 18S enough for statistical analysis. 
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BREAKDOWN OF PERCENT OF LETTERS PER WORD 


~ WK XK KR KK 
SN K KK KK KK KK 


oO K K K KR KK 


6 
etters ina Wo 


x 
Xx 
X 
X 
Xx 
S 
x 
; 10 dl 12 iS 14 15 


X 
X 
Xx 
X 
Xx 
8 

of rd 

The average number of letters per word > 6. 500000000E+T00 

A typical nonfiction texts distribution 


resembles a low bell-shaped curve centered around six letters/word. 


The length of the words is medium 


12? 


BREAKDOWN OF NUMBER OF WORDS PER SENTENCE 
Number of words 


100 

98 

96 

94 

ae 

90 

88 

8&6 

84 

$2 

80 

78 

76 
74 
tz 

0 
68 
66 
64 
62 
60 
58 
56 
54 
52 
50 
48 
G6 x 
44 Xx X 
42 X X x 
40 X X x 
20 X X x x 
36 X X x xX x 
34 xX X XX x xX 
52 x X X XX XX x x 
30 x X X XX XX x xX X 
28 x x X X XX XX x X X XXX 
26 x x Xx X xX XX XXX XK X X X X XXX 
24 X X x X X X XX XXX X XxX X X KX XXX 
22 x x x X X xX XX XXX X XX X X X XXX 
c0 XX x Ke KKK CX KX XX XXXX X XX X X X XXX 
18 X XxX X X XX XXXXX XX XX XXXX X XX X X X XXX 
16 XXXXXXX X X XXX XXKXXX KXXKXX XKXK XXXKXKXXXKKKX X X X XXX 
14 XXXXXXX X X XXXXXKXXXX XXXK XXX XKXXXXXXKXKXX KX K XXX XXX 
12 XXXXXXX KXK XKXXKXXKXXXKX KXXXXXXXXXXXXXKKXKXXKKXXK KX XXK XXX 
1Q = XXXXXXX XXX XXXXXXXXXXXXXXXXXXKXKXXXXKXKKKXXXXKXKXXXKX XXXXXXX 
8 XXXXXXX XXX XXXXXXXXXKXXXXXXXXKXXKXKXKKXXXKXXKXKXXXKXKXKX XXXXXXX 
6 XXXXXXX XXX XXXXXXXXKXXXXXKXKXKXKXXKXKKXXKXKXKK XK XKXKXKXKXXXKXXKXKKX 
G  XXXXXXXXXKXXXKXXXXKXKXKXKKXXKXKXKXXKKKKXKKXKKAXKXKKXAKKXKXKKKXKKKKKXKXAKKX 
2 XXXXXXXXXKXXKKKKKXKXKKKKXKXKKKAXXKXKKKKXKXKXKKKXKXKKKKKKKXKKXKKKKAKKKX 


10 20 30 GO 50 60 70 
Sentence Number 


The average number of words per sentence > 2,103571928E+01 


A typical modern texts sentences average between fifteen and twenty words. 
Sentences are long. 


Number of run ons 1 
Percent of run ons 1 


Run ons are acceptable. 
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Number of medium length sentences 28 
Percent of medium length sentences 50 


Modulation is average 
ETYMOLOGY OF WORDS 
Number of Latinate words : G22 


Number of Germanic words ; 2e/ 


Etymology 1S mixed 


DIFFICULTY OF VOCABULARY 


PostGraduate difficulty : 4 
Graduate difficulty : 55 
High School difficulty : 213 
Elementary difficulty : 320 
Percent of Postgraduate difficulty 0 
Percent of Graduate difficulty 5 
Peneemt of High School difficulty 26 


Difficulty is challenging 


GANGESIUITY 
Number of Tangible words : 106 
Number of Intangible words es: 543 


Tangibility is very intangibible. 


EMOTIONAL CONNOTATIONS 


Sublime connotations : il 
Pleasant connotations : 50 
Neutral connotations : B62 
Unpleasant connotations : a2 
Horrid connotations : Gq 


Percent of sublime connotations 
Percent of pleasant connotations 
Percent of unpleasant connotations 
Percent of horrid connotations 


OWLS 


Index of Emotionality 14 
Emotionality 18S average 


Tone is Positive 


Eco 


VIGOR OF WORDS 


Words of Extreme Vigor =: 7 
Words of Much Vigor ‘ 34 
Words of Some Vigor : 30! 
Words of Little Vigor : 307 


Percent of words of extreme vigor 
Percent of words of much vigor 
Percent of words of some vigor : 2 


WWNO 


Index of Vigor GG 


Vigor is lively 


This 1s a Solid piece of writing well within the traditions of its genre. 


You are as able to understand the meaning of the above characteristics as The St 
Vis <. 
tand the course! 


Excerpt from Galapagos by Vonnegut 


PROFILE 
Total of sentences : ee 
Total of words : 542 
Total of letters ‘ 2526 


542 words are enough for valid statistical analysis. 


Of the 5G2 words in Excerpt from Galapagos by Vonnegut 
G49 were matched to words in the Stylist dictionary. 


82 % were matched. 
This is enough for statistical analysis. 


ok 


BREAKDOWN OF PERCENT OF LETTERS PER WORD 


x KX KM K KM KK 
SN KR K KR KK OK 
< 
Oo K KR KK 


6 


x 
x 
X 
x 
Xx 
x 
X 
x 
x 
x 
X 
5 
Letters in a Word 


x 
x 
x x 
X x 
x x 
X x 
X X 
x x 
5 4 
Number of 
The average number of letters per word 2 3. /602ZS55946Et00 
A typical fiction texts distribution 


resembles a tall bell-shaped curve centered around five letters/word. 


The length of the words is long. 


l BREAKDOWN OF NUMBER OF WORDS PER SENTENCE 
Number of words 


100 

98 

96 

94 

92 

90 

88 

36 

84 

82 

80 

78 

76 

74 

ie 

70 

63 

66 

54 

oe 

60 

53 

56 

5G 

52 

50 

48 x 

G6 x x 
GG x x x 
G2 x x x 
40 x x x x 
38 XX xX X Xx 
36 XX XX x x 
39 X AK KK x x 

Size xX XX XxX x x 
SO) xX X Xo XX Xx x 

Zoom x xX XX XxX x x 

Ome, X > a. © 4 x x 
oC Cmex X XX XxX x Xx 
Zemex X XX XxX me. xX 
2OmenK 6OUCXXX OCOKX > i. aap. § 
Mom CXXX CUXXX XK OX COX 
Wome | OXXX XXXX X XX X X 
om AAXX XXXXXXXXXXX X 
12 XXX XXXX XXXXXXXXXXX X 
WM AXXX XXXXXXXXXXXXX 
8 XXXXKXXXK KXXXXXXXKXXXXXX 
6 XXXXXXXXXXXXKXKKXXXXXX 
G XXXXXXXXXXXXXXXXKXXXXXX 
Pe AX KXXXXKXXXXXXXXXX 

10 20 30 G0 50 60 70 
Sentence Number 
The average number of words per sentence > 2.4636363649E+01 


A typical modern texts sentences average between fifteen and twenty words. 
Sentences are too long. 


Number of run ons Z 
Percent of run ons 9 


Run ons are unacceptable 
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Number of medium length sentences 10 
Percent of medium length sentences G5 


Modulation is average 


ETYMOLOGY OF WORDS 
Number of Latinate words : 86 
Number of Germanic words : 184 


EtYmMOLlOSY 1S Very mative. 


DIFFICULTY OF VOCABULARY 


PostGraduate difficulty : 0 
Graduate difficulty : 5 
Hidh- Sscnool Vda fficulty : G9 
Elementary difficulty : 218 
Percent of Postgraduate difficulty 0 
Percent of Graduate difficulty 0 
Percent of High School difficulty 10 


Difficulty is easy. 


TANGIBIEIGY 
Number of Tangible words : 76 
Number of Intangible words es: 194 


Tangibility is tangibible. 


EMOTIONAL CONNOTATIONS 


Sublime connotations 

Pleasant connotations 

Neutral connotations : 1 
Unpleasant connotations 

Horrid connotations 


~~ 
NWOwON 


Percent of sublime connotations 

Percent of pleasant connotations 1 
Percent of unpleasant connotations 

Percent of horrid connotations 


Im KO) SM © 


Index of Emotionality G3 
Emotionality is Rich 


Tone 1s Positive 


VIGOR OF WORDS 


Words of cxtreme Vigor =: 3 
Words of Much Vigor : 39 
Words of Some Vigor : 25 
Words of Little Vigor : oS 


Percent of words of extreme vigor : 0 
Percent of words of much vigor : 8 
Percent of words of some vigor : a7 
Index of Vigor 67 


Vigor is very strong 


RECOMMENDATION NUMBER 1 


You tend to write run-on sentences. 
Check your longest sentences for run ons. 
Break them up into units of single ideas. 
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tJ 


Gr 


Car 
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